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Foreword 


This document is Version 4.1 of RSA Laboratories’ Frequently Asked Questions About Today’s Cryptography, 
a minor editorial update of Version 4.0 from 1998. Some misprints and errors in Version 4.0 have 
been corrected, and several of the answers have been updated. The labels of the questions, however, 
have not been changed, except that a few new questions have been added and some obsolete 
questions have been removed. Moreover, an Appendix with some mathematical concepts has been 


added. 


The FAQ represents the contributions of numerous individuals. Particular appreciation is due to 
Paul Fahn, who wrote the first and second versions while an RSA Laboratories research assistant in 
1992 and 1993, to Sambasivam Valliappan, who drafted the thitd version as an RSA Laboratories 
research assistant in Summer 1995, and to Moses Liskov and Beverly Schmoock, who were the 
technical editors of Version 4.0. 


Other contributors include Michael S. Baum, Jim Bidzos, John Brainard, Mathew Butler, Victor 
Chang, Scott Contini, Dana Ellingen, James Gray, Stuart Haber, Ari Juels, Burton S. Kaliski, Jr, 
Patrick Lee, John Linn, Paul Livesay, Hoa Ly, Tim Matthews, Bart Preneel, Matthew J.B. Robshaw, 
Raymond M. Sidney, Robert D. Silverman, Jessica Staddon, Jeff Stapleton, Kurt Stammberger, Scott 
Stornetta, and Yiqun Lisa Yin. We add that several people have been involved in reviewing this 
version and its predecessors. 


The technical editor of Version 4.1 is Jakob Jonsson, RSA Laboratories Europe. 


Comments on the FAQ are encouraged. Address correspondence to: 


FAQ Editor 

RSA Laboratories 

20 Crosby Drive 
Bedford, MA 01730 USA 


Phone: +1 781 687 7000 
Fax : +1 781 687 7213 
e-mail: [Faqrediterdrsasecurity coq. 


CHAPTER 7 
Introduction 


In this introductory chapter, a brief overview of the field of cryptography and related issues is given. 


1.1 Whatis RSA Laboratories’ Frequently Asked Questions About 
Today’s Cryptography? 


RSA Laboratories’ Frequently Asked Questions About Today's Cryptography is a large collection of questions 
about modern cryptography, cryptanalysis, and issues related to them. The information is presented 
in question and answer form. We have not attempted to be, nor could we be, exhaustive in answering 
every possible question. Yet, we hope that this document will be both a useful introductory text and 
a useful reference for those interested in the field of cryptography. 
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1.2 Whatis cryptography? 


As the field of cryptography has advanced, the dividing lines for what is and what is not cryptography 
have become blurred. Cryptography today might be summed up as the study of techniques and 
applications that depend on the existence of difficult problems. Cryptanalysis is the study of how to 
compromise (defeat) cryptographic mechanisms, and cryptology (from the Greek &ryptos logos, meaning 
“hidden word”) is the discipline of cryptography and cryptanalysis combined. To most people, 
cryptography is concerned with keeping communications private. Indeed, the protection of sensitive 
communications has been the emphasis of cryptography throughout much of its history [Kah67]. 
However, this is only one part of today’s cryptography. 


Encryption is the transformation of data into a form that is as close to impossible as possible to read 
without the appropriate knowledge (a key; see below). Its purpose is to ensure privacy by keeping 
information hidden from anyone for whom it is not intended, even those who have access to the 
encrypted data. Decryption is the reverse of encryption; it is the transformation of encrypted data back 
into an intelligible form. 


Encryption and decryption generally require the use of some secret information, referred to as a key. 
For some encryption mechanisms, the same key is used for both encryption and decryption; for 
other mechanisms, the keys used for encryption and decryption are different (see Question 2.1.1). 


Today's cryptography is more than encryption and decryption. Authentication is as fundamentally a 
part of our lives as privacy. We use authentication throughout our everyday lives -- when we sign 
our name to some document for instance -- and, as we move to a world where our decisions and 
agreements ate communicated electronically, we need to have electronic techniques for providing 
authentication. 


Cryptography provides mechanisms for such procedures. A digital signature (see Question binds 
a document to the possessor of a particular key, while a digital timestamp (see Question [7.TT) binds 
a document to its creation at a particular time. These cryptographic mechanisms can be used to 
control access to a shared disk drive, a high security installation, or a pay-per-view TV channel. 


The field of cryptography encompasses other uses as well. With just a few basic cryptographic tools, 
it is possible to build elaborate schemes and protocols that allow us to pay using electronic money 
(see Question 4.2.1), to prove we know certain information without revealing the information itself 
(see Question 2.1.8), and to share a secret quantity in such a way that a subset of the shares can 
reconstruct the secret (see Question 2.1.9). 


While modern cryptography is growing increasingly diverse, cryptography is fundamentally based on 
problems that are difficult to solve. A problem may be difficult because its solution requires some 
secret knowledge, such as decrypting an encrypted message or signing some digital document. The 
problem may also be hard because it is intrinsically difficult to complete, such as finding a message 
that produces a given hash value (see Question 2.1.0). 


Surveys by Rivest [Riv90] and Brassard [Bragð] form an excellent introduction to modern cryptogra- 


phy. Some textbook treatments are provided by Stinson [Sti95] and Stallings [Sta95], while Simmons 
provides an in-depth coverage of the technical aspects of cryptography [Sim92]. A comprehensive 
review of modern cryptography can also be found in Applied Cryptography [Sch9G]; Ford [Eor94] 
provides detailed coverage of issues such as cryptography standards and secure communication. 
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1.3 What are some of the more popular techniques in cryptography? 


There are two types of cryptosystems: secret-key and public-key cryptography (see Questions and 
2.1.1). In secret-key cryptography, also referred to as symmetric cryptography, the same key is used 
for both encryption and decryption. The most popular secret-key cryptosystem in use today is the 
Data Encryption Standard (DES; see Section B.2). 


In public-key cryptography, each user has a public Rey and a private key. The public key is made public 
while the private key remains secret. Encryption is performed with the public key while decryption is 
done with the private key. The RSA public-key cryptosystem (see Section B.T) is the most popular form 
of public-key cryptography. RSA stands for Rivest, Shamir, and Adleman, the inventors of the RSA 
ctyptosystem. 


The Digital Signature Algorithm (DSA; see Section B.4) is also a popular public-key technique, though it 
can only be used only for signatures, not encryption. Elliptic curve cryptosystems (ECCs; see Section B.5) 
ate cryptosystems based on mathematical objects known as elliptic curves (see Question 2.3.10). 
Elliptic curve cryptography has been gaining in popularity recently. Lastly, the Diffie-Hellman key 
agreement protocol (see Question B.6.1) is a popular public-key technique for establishing secret keys 
over an insecure channel. 


1.4 How is cryptography applied? 


Cryptography is extremely useful; there is a multitude of applications, many of which are currently 
in use. A typical application of cryptography is a system built out of the basic techniques. Such 
systems can be of various levels of complexity. Some of the more simple applications are secure 
communication, identification, authentication, and secret sharing. More complicated applications 
include systems for electronic commerce, certification, secure electronic mail, key recovery, and 
secure computer access. 

In general, the less complex the application, the more quickly it becomes a reality. Identification 
and authentication schemes exist widely, while electronic commerce systems are just beginning to be 
established. However, there are exceptions to this rule; namely, the adoption rate may depend on 
the level of demand. For example, SSL-encapsulated HTTP (see Question 6.1.2) gained a lot more 
usage much more quickly than simpler link-layer encryption has ever achieved. The adoption rate 
may depend on the level of demand. 


Secure Communication 

Secure communication is the most straightforward use of cryptography. Two people may com- 
municate securely by encrypting the messages sent between them. This can be done in such a 
way that a third party eavesdropping may never be able to decipher the messages. While secure 
communication has existed for centuries, the key management problem has prevented it from 
becoming commonplace. Thanks to the development of public-key cryptography, the tools exist to 
create a large-scale network of people who can communicate securely with one another even if they 
had never communicated before. 


Identification and Authentication 

Identification and authentication are two widely used applications of cryptography. Identification is 
the process of verifying someone’s or something’s identity. For example, when withdrawing money 
from a bank, a teller asks to see identification (for example, a driver’s license) to verify the identity of 
the owner of the account. This same process can be done electronically using cryptography. Every 
automatic teller machine (ATM) card is associated with a “secret” personal identification number 
(PIN), which binds the owner to the catd and thus to the account. When the card is inserted into the 
ATM, the machine prompts the cardholder for the PIN. If the correct PIN is entered, the machine 
identifies that person as the rightful owner and grants access. Another important application of 
ctyptography is authentication. Authentication is similar to identification, in that both allow an entity 
access to resources (such as an Internet account), but authentication is broader because it does not 
necessarily involve identifying a person or entity. Authentication merely determines whether that 
person or entity is authorized for whatever is in question. For more information on authentication 
and identification, see Question 2.2.5] 


Secret Sharing 

Another application of cryptography, called secret sharing, allows the trust of a secret to be distributed 
among a group of people. For example, in a (k, n)-threshold scheme, information about a secret is 
distributed in such a way that any k out of the n people (k < n) have enough information to determine 
the secret, but any set of k — 1 people do not. In any secret sharing scheme, there are designated sets 
of people whose cumulative information suffices to determine the secret. In some implementations 
of secret sharing schemes, each participant receives the secret after it has been generated. In other 
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implementations, the actual secret is never made visible to the participants, although the purpose for 
which they sought the secret (for example, access to a building or permission to execute a process) is 
allowed. See Question 2.1.9 for more information on secret sharing. 


Electronic Commerce 

Over the past few years there has been a growing amount of business conducted over the Internet 
-- this form of business is called electronic commerce or e-commerce. E-commerce is comprised 
of online banking, online brokerage accounts, and Internet shopping, to name a few of the many 
applications. One can book plane tickets, make hotel reservations, rent a car, transfer money from 
one account to another, buy compact disks (CDs), clothes, books and so on all while sitting in 
front of a computer. However, simply entering a credit card number on the Internet leaves one 
open to fraud. One cryptographic solution to this problem is to encrypt the credit card number 
(or other private information) when it is entered online, another is to secure the entire session (see 
Question 6.1.2). When a computer encrypts this information and sends it out on the Internet, it 
is incomprehensible to a third party viewer. The web server (“Internet shopping center”) receives 
the encrypted information, decrypts it, and proceeds with the sale without fear that the credit card 
number (or other personal information) slipped into the wrong hands. As more and more business 
is conducted over the Internet, the need for protection against fraud, theft, and corruption of vital 


information increases. 


Certification 

Another application of cryptography is certification; certification is a scheme by which trusted agents 
such as certifying authorities vouch for unknown agents, such as users. The trusted agents issue 
vouchers called certificates which each have some inherent meaning. Certification technology was 
developed to make identification and authentication possible on a large scale. See Question f.1.3.1] 
for more information on certification. 


Key Recovery 

Key recovery is a technology that allows a key to be revealed under certain circumstances without 
the owner of the key revealing it. This is useful for two main reasons: first of all, if a user loses 
or accidentally deletes his or her key, key recovery could prevent a disaster. Secondly, if a law 
enforcement agency wishes to eavesdrop on a suspected criminal without the suspect’s knowledge 
(akin to a wiretap), the agency must be able to recover the key. Key recovery techniques are in use 
in some instances; however, the use of key recovery as a law enforcement technique is somewhat 
controversial. See Question for more on key recovery. 


Remote Access 

Secure remote access is another important application of cryptography. The basic system of 
passwords certainly gives a level of security for secure access, but it may not be enough in some 
cases. For instance, passwords can be eavesdropped, forgotten, stolen, or guessed. Many products 
supply cryptographic methods for remote access with a higher degree of security. 


Other Applications 

Cryptography is not confined to the world of computers. Cryptography is also used in cellular 
(mobile) phones as a means of authentication; that is, it can be used to verify that a particular phone 
has the right to bill to a particular phone number. This prevents people from stealing (“cloning”) 


cellular phone numbers and access codes. Another application is to protect phone calls from 
eavesdropping using voice encryption. 
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1.5 Whatare cryptography standards? 


Cryptography standards are needed to create interoperability in the information security wotld. 
Essentially they are conditions and protocols set forth to allow uniformity within communication, 
transactions and virtually all computer activity. The continual evolution of information technology 
motivates the development of more standards, which in turn helps guide this evolution. 


The main motivation behind standards is to allow technology from different manufacturers to “speak 
the same language”, that is, to interact effectively. Perhaps this is best seen in the familiar standard 
VHS for video cassette recorders (VCRs). A few years ago there were two competing standards in 
the VCR industry, VHS and BETA. A VHS tape could not be played in a BETA machine and vice 
versa; they were incompatible formats. Imagine the chaos if all VCR manufacturers had different 
formats. People could only rent movies that were available on the format compatible with their VCR. 
Standards are necessary to insure that products from different companies are compatible. 


In cryptography, standardization serves an additional purpose; it can serve as a proving ground for 
ctyptographic techniques because complex protocols are prone to design flaws. By establishing a 
well-examined standard, the industry can produce a more trustworthy product. Even a safe protocol 
is more trusted by customers after it becomes a standard, because of the ratification process involved. 


The government, private industry, and other organizations contribute to the vast collection of 
standards on cryptography. A few of these are ISO, ANSI, IEEE, NIST, and IETF (see Section 6.3). 
There are many types of standards, some used within the banking industry, some internationally 
and others within the government. Standardization helps developers design new products. Instead 
of spending time developing a new standard, they can follow a pre-existing standard throughout 
the development process. With this process in place consumers have the chance to choose among 
competing products or services. 


1.6 What is the role of the United States government in cryptography? 


The U.S. government plays many roles in cryptography, ranging from use to export control to 
standardization efforts to the development of new cryptosystems. Recently the government has 
taken an even bigger interest in cryptography due to its ever-increasing use outside of the military. 


One obvious reason the U.S. government is interested in cryptography stems from the crucial role of 
secure communication during wartime. Because the enemy may have access to the communication 
medium, messages must be encrypted. With certain cryptosystems, the receiver can determine 
whether or not the message was tampered with during transmission, and whether the message really 
came from who claims to have sent it. 


In the past, the government has not only used cryptography itself, but has cracked other country’s 
codes as well. A notable example of this occurred in 1940 when a group of Navy cryptanalysts, led 
by William F. Friedman, succeeded in breaking the Japanese diplomatic cipher known as Purple. 


In 1952, the U.S. government established The National Security Agency (NSA; see Question (6.2.2), 
whose job is to handle military and government data security as well as gather information about 
other countries’ communications. Also established was The National Institute of Standards and Technology 
(NIST; see Question 6.2.1), which plays a major role in developing cryptography standards. 


During the 1970’s, IBM and the U.S. Department of Commerce -- more precisely NIST (then known 
as NBS) -- developed along with NSA the Data Encryption Standard (DES; see Section B.2). This 
algorithm has been a standard since 1977, with reviews leading to renewals every few years. The 
general consensus is that DES is no longer strong enough for today’s encryption needs. Therefore, 
NIST is currently working on a new standard, the Advanced Encryption Standard (AES; see Section B.3), 
to replace DES. In the intermediate stage, triple-DES (see Question B.2.0) is the encryption standard. 
It is expected that AES will remain a standard well into the 21st century. 


Currently there are no restrictions on the use or strength of domestic encryption (encryption 
where the sender and recipient are in the U.S.). However, the government regulates the export of 
cryptography from the U.S. by setting restrictions (see Section on how strong such encryption 
may be. Cryptographic exports are controlled under the Export Administration Regulations (EAR), 
and their treatment varies according to several factors including destinations, customers, and the 
strength and usage of the cryptography involved. In January 2000, the restrictions were significantly 
relaxed; today, any cryptographic product can be exported to non-governmental end-users outside 
embargoed destinations (states supporting terrorism) without a license. 
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1.7 Whyis cryptography important? 


Cryptography allows people to carry over the confidence found in the physical world to the electronic 
world, thus allowing people to do business electronically without worries of deceit and deception. 
Every day hundreds of thousands of people interact electronically, whether it is through e-mail, e- 
commerce (business conducted over the Internet), ATM machines, or cellular phones. The perpetual 
increase of information transmitted electronically has lead to an increased reliance on cryptography. 


Cryptography on the Internet 

The Internet, comprised of millions of interconnected computers, allows nearly instantaneous 
communication and transfer of information, around the world. People use e-mail to correspond 
with one another. The World Wide Web is used for online business, data distribution, marketing, 
research, learning, and a myriad of other activities. 


Cryptography makes secure web sites (see Question and electronic safe transmissions possible. 
For a web site to be secure all of the data transmitted between the computers where the data is 
kept and where it is received must be encrypted. This allows people to do online banking, online 
trading, and make online purchases with their credit cards, without worrying that any of their account 
information is being compromised. Cryptography is very important to the continued growth of the 
Internet and electronic commerce. 


E-commerce (see Section 4.) is increasing at a very rapid rate. By the turn of the century, commercial 
transactions on the Internet are expected to total hundreds of billions of dollars a year. This level 
of activity could not be supported without cryptographic security. It has been said that one is safer 
using a credit card over the Internet than within a store or restaurant. It requires much more work 
to seize credit card numbers over computer networks than it does to simply walk by a table in a 
restaurant and lay hold of a credit card receipt. These levels of security, though not yet widely used, 
give the means to strengthen the foundation with which e-commerce can grow. 


People use e-mail to conduct personal and business matters on a daily basis. E-mail has no physical 
form and may exist electronically in more than one place at a time. This poses a potential problem 
as it increases the opportunity for an eavesdropper to get a hold of the transmission. Encryption 
protects e-mail by rendering it very difficult to read by any unintended party. Digital signatures can 
also be used to authenticate the origin and the content of an e-mail message. 


Authentication 

In some cases cryptography allows you to have more confidence in your electronic transactions than 
you do in real life transactions. For example, signing documents in real life still leaves one vulnerable 
to the following scenario. After signing your will, agreeing to what is put forth in the document, 
someone can change that document and your signature is still attached. In the electronic world this 
type of falsification is much more difficult because digital signatures (see Question are built 
using the contents of the document being signed. 


Access Control 
Cryptography is also used to regulate access to satellite and cable TV. Cable TV is set up so people 
can watch only the channels they pay for. Since there is a direct line from the cable company to 


each individual subscriber”s home, the Cable Company will only send those channels that are paid 
for. Many companies offer pay-per-view channels to their subscribers. Pay-per-view cable allows 
cable subscribers to “rent” a movie directly through the cable box. What the cable box does is 
decode the incoming movie, but not until the movie has been “rented.” If a person wants to watch a 
pay-per-view movie, he/she calls the cable company and requests it. In return, the Cable Company 
sends out a signal to the subscriber’s cable box, which unscrambles (decrypts) the requested movie. 


Satellite TV works slightly differently since the satellite TV companies do not have a direct connection 
to each individual subscriber's home. This means that anyone with a satellite dish can pick up the 
signals. To alleviate the problem of people getting free TV, they use cryptography. The trick is to 
allow only those who have paid for their service to unscramble the transmission; this is done with 
receivers (“unscramblers”). Each subscriber is given a receiver; the satellite transmits signals that can 
only be unscrambled by such a receiver (ideally). Pay-per-view works in essentially the same way as it 
does for regular cable TV. 


As seen, cryptography is widely used. Not only is it used over the Internet, but also it is used in 
phones, televisions, and a variety of other common household items. Without cryptography, hackers 
could get into our e-mail, listen in on our phone conversations, tap into our cable companies and 
acquire free cable service, or break into our bank/brokerage accounts. 
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CHAPTER 2 
Cryptography 


This chapter expands on the overview of the field of cryptography and related issues, providing more 
detail about the concepts involved in cryptography. It lays the conceptual groundwork for the next 
chapter. 


2.1 CRYPTOGRAPHIC TOOLS 
2.1.1 Whatis public-key cryptography? 


In traditional cryptography, the sender and receiver of a message know and use the same secret 
key; the sender uses the secret key to encrypt the message, and the receiver uses the same secret 
key to decrypt the message. This method is known as secret key or symmetric cryptography (see 
Question 2.1.2). The main challenge is getting the sender and receiver to agree on the secret key 
without anyone else finding out. If they are in separate physical locations, they must trust a courier, 
a phone system, or some other transmission medium to prevent the disclosure of the secret key. 
Anyone who overhears or intercepts the key in transit can later read, modify, and forge all messages 
encrypted or authenticated using that key. The generation, transmission and storage of keys is called 
key management (see Section AT); all cryptosystems must deal with key management issues. Because 
all keys in a secret-key cryptosystem must remain secret, secret-key cryptography often has difficulty 
providing secure key management, especially in open systems with a large number of users. 


In order to solve the key management problem, Whitfield Diffie and Martin Hellman [DHZ6] 
introduced the concept of public-key cryptography in 1976. Public-key cryptosystems have two 
primary uses, encryption and digital signatures. In their system, each person gets a pair of keys, one 
called the public key and the other called the private key. The public key is published, while the private 
key is kept secret. The need for the sender and receiver to share secret information is eliminated; all 
communications involve only public keys, and no private key is ever transmitted or shared. In this 
system, it is no longer necessary to trust the security of some means of communications. The only 
requirement is that public keys be associated with their users in a trusted (authenticated) manner 
(for instance, in a trusted directory). Anyone can send a confidential message by just using public 
information, but the message can only be decrypted with a private key, which is in the sole possession 
of the intended recipient. Furthermore, public-key cryptography can be used not only for privacy 
(encryption), but also for authentication (digital signatures) and other various techniques. 


In a public-key cryptosystem, the private key is always linked mathematically to the public key. 
Therefore, it is always possible to attack a public-key system by deriving the private key from the 
public key. Typically, the defense against this is to make the problem of deriving the private key from 
the public key as difficult as possible. For instance, some public-key cryptosystems are designed such 


that deriving the private key from the public key requires the attacker to factor a large number, it 
this case it is computationally infeasible to perform the derivation. This is the idea behind the RSA 
public-key cryptosystem. 


Encryption 

When Alice wishes to send a secret message to Bob, she looks up Bob's public key in a directory, 
uses it to encrypt the message and sends it off. Bob then uses his private key to decrypt the message 
and read it. No one listening in can decrypt the message. Anyone can send an encrypted message to 
Bob, but only Bob can read it (because only Bob knows Bob’s private key). 


Digital Signatures 

To sign a message, Alice does a computation involving both her private key and the message itself. 
The output is called a digital signature and is attached to the message. To verify the signature, Bob 
does a computation involving the message, the purported signature, and Alice’s public key. If the 
result is correct according to a simple, prescribed mathematical relation, the signature is verified to 
be genuine; otherwise, the signature is fraudulent, or the message may have been altered. 


A good history of public-key cryptography is given by Diffie [Dif88]. 
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2.1.2 Whatis secret-key cryptography? 


Secret-key cryptography is sometimes referred to as symmetric cryptography. It is the more traditional 
form of cryptography, in which a single key can be used to encrypt and decrypt a message. Secret-key 
ctyptography not only deals with encryption, but it also deals with authentication. One such technique 
is called message authentication codes (MACs; see Question 2.1.7). 


The main problem with secret-key cryptosystems is getting the sender and receiver to agree on the 
secret key without anyone else finding out. This requires a method by which the two parties can 
communicate without fear of eavesdropping. However, the advantage of secret-key cryptography is 
that it is generally faster than public-key cryptography. 


The most common techniques in secret-key cryptography ate block ciphers (see Question 2.1.4), stream 
ciphers (see Question 2.1.5), and message authentication codes. 


2.1.3 What are the advantages and disadvantages of public-key cryptography 


compared with secret-key cryptography? 


The primary advantage of public-key cryptography is increased security and convenience: private 
keys never need to be transmitted or revealed to anyone. In a secret-key system, by contrast, the 
secret keys must be transmitted (either manually or through a communication channel) since the 
same key is used for encryption and decryption. A serious concern is that there may be a chance that 
an enemy can discover the secret key during transmission. 


Another major advantage of public-key systems is that they can provide digital signatures that 
cannot be repudiated. Authentication via secret-key systems requires the sharing of some secret 
and sometimes requires trust of a third party as well. As a result, a sender can repudiate a 
previously authenticated message by claiming the shared secret was somehow compromised (see 
Question by one of the parties sharing the secret. For example, the Kerberos secret-key 
authentication system (see Question 6.1.q) involves a central database that keeps copies of the secret 
keys of all users; an attack on the database would allow widespread forgery. Public-key authentication, 
on the other hand, prevents this type of repudiation; each user has sole responsibility for protecting 
his or her private key. This property of public-key authentication is often called non-repudiation. 


A disadvantage of using public-key cryptography for encryption is speed. There are many secret-key 
encryption methods that are significantly faster than any currently available public-key encryption 
method. Nevertheless, public-key cryptography can be used with secret-key cryptography to get the 
best of both worlds. For encryption, the best solution is to combine public- and secret-key systems 
in order to get both the security advantages of public-key systems and the speed advantages of 
sectet-key systems. Such a protocol is called a digital envelope, which is explained in more detail in 


Question 2.2.4] 


Public-key cryptography may be vulnerable to impersonation, even if users” private keys are not 
available. A successful attack on a certification authority (see Question H.1.3.14) will allow an 
adversary to impersonate whomever he or she chooses by using a public-key certificate from the 
compromised authority to bind a key of the adversary's choice to the name of another user. 


In some situations, public-key cryptography is not necessary and secret-key cryptography alone is 
sufficient. These include environments where secure secret key distribution can take place, for 
example, by users meeting in private. It also includes environments where a single authority knows 
and manages all the keys, for example, a closed banking system. Since the authority knows everyone’s 
keys already, there is not much advantage for some to be “public” and others to be “private.” Note, 
however, that such a system may become impractical if the number of users becomes large; there are 
not necessarily any such limitations in a public-key system. 


Public-key cryptography is usually not necessary in a single-user environment. For example, if you 
want to keep your personal files encrypted, you can do so with any secret key encryption algorithm 
using, say, your personal password as the secret key. In general, public-key cryptography is best 
suited for an open multi-user environment. 
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Public-key cryptography is not meant to replace secret-key cryptography, but rather to supplement 
it, to make it more secure. The first use of public-key techniques was for secure key establishment 
in a secret-key system [DH70]; this is still one of its primary functions. Secret-key cryptography 
remains extremely important and is the subject of much ongoing study and research. Some secret-key 
cryptosystems are discussed in the sections on block ciphers and stream ciphers. 


2.1.4 Whatis a block cipher? 


A block cipher is a type of symmetric-key encryption algorithm that transforms a fixed-length block 
of plaintext (unencrypted text) data into a block of ciphertext (encrypted text) data of the same length. 
This transformation takes place under the action of a user-provided secret key. Decryption is 
performed by applying the reverse transformation to the ciphertext block using the same secret key. 
The fixed length is called the block size, and for many block ciphers, the block size is 64 bits. In the 
coming years the block size will increase to 128 bits as processors become more sophisticated. 


For those with a mathematical background (see Appendix [A): Since different plaintext blocks 
are mapped to different ciphertext blocks (to allow unique decryption), a block cipher effectively 
provides a permutation (one to one reversible correspondence) of the set of all possible messages. 
The permutation effected during any particular encryption is of course secret, since it is a function 
of the secret key. 


When we use a block cipher to encrypt a message of arbitrary length, we use techniques known 
as modes of operation for the block cipher. To be useful, a mode must be at least as secure and 
as efficient as the underlying cipher. Modes may have properties in addition to those inherent in 
the basic cipher. The standard DES modes (see Question B.2.3) have been published in FIPS 81 
[NIS80] and as ANSI X3.106 [ANS83]. A more general version of the standard [[SO925] generalized 
the four modes of DES to be applicable to a block cipher of any block size. The standard modes 
ate Electronic Code Book (Question 2.1.4.2), Cipher Block Chaining (Question 2.1.4.2), Cipher Feedback 


(Question 2.1.4.4), and Output Feedback (Question 2.1.4.5). 


More information about block ciphers and the various available algorithms can be found in almost 
any book on contemporary cryptography. 
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2.1.4.1 Whatis an iterated block cipher? 


Iterated block ciphers encrypt a plaintext block by a process that has several rounds. In each round, 
the same transformation (also known as a round function) is applied to the data using a subkey. The 
set of subkeys is usually derived from the user-provided secret key by a special function. The set of 
subkeys is called the ey schedule. The number of rounds in an iterated cipher depends on the desired 
security level and the consequent tradeoff with performance. In most cases, an increased number 
of rounds will improve the security offered by a block cipher, but for some ciphers the number 
of rounds required to achieve adequate security will be too large for the cipher to be practical or 
desirable. 


Feistel ciphers \Eet73] are a special class of iterated block ciphers where the ciphertext is calculated 
from the plaintext by repeated application of the same transformation or round function. Feistel 


ciphers are sometimes called DES-Zke ciphers (see Section B.2). 
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Figure 2.1: Feistel Cipher. 


In a Feistel cipher (see Figure 2.1.4.1), the text being encrypted is split into two halves. The round 
function f is applied to one half using a subkey and the output of f is XORed with the other half. 
The two halves are then swapped. Each round follows the same pattern except for the last round 
where there is no swap. 


A nice feature of a Feistel cipher is that encryption and decryption are structurally identical, though 
the subkeys used during encryption at each round are taken in reverse order during decryption. 
More precisely, the input in the decryption algorithm is the pair (Rr, Lr) instead of the pair (Lo, Ro) 
(notations as in Figure 2.1.4.1), and the ith subkey is k,_;41, not ki. This means that we obtain 
(Rr—i, Lr—i) instead of (L;, Ri) after the ith round. For example, Ry is replaced with 


Rr y F(kr, Lr) — Rr y F (ky, Ry-1) = Rr 9 (Rr y L,-1) = Teas 


It is of course possible to design iterative ciphers that are not Feistel ciphers, yet whose encryption 
and decryption (after a certain re-ordering or re-calculation of variables) are structurally the same. 
Some examples are IDEA (see Question B.6.7) and several of the candidates for the AES (see 


Section B.3). 


2.1.4.2 Whatis Electronic Code Book Mode? 


In ECB mode (see Figure 2.2), each plaintext block is encrypted independently with the block cipher. 
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Figure 2.2: Electronic Code Book Mode. 


ECB mode is as secure as the underlying block cipher. However, plaintext patterns are not concealed. 
Each identical block of plaintext gives an identical block of ciphertext. The plaintext can be easily 
manipulated by removing, repeating, or interchanging blocks. The speed of each encryption operation 
is identical to that of the block cipher. ECB allows easy parallelization to yield higher performance. 
Unfortunately, no processing is possible before a block is seen (except for key setup). 
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2.1.4.3 Whatis Cipher Block Chaining Mode? 


In CBC mode (see Figure 2.3), each plaintext block is XORed with the previous ciphertext block and 
then encrypted. An initialization vector co is used as a “seed” for the process. 
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Figure 2.3: Cipher Block Chaining Encryption Mode. 


CBC mode is as secure as the underlying block cipher against standard attacks. In addition, any 
patterns in the plaintext are concealed by the XORing of the previous ciphertext block with the 
plaintext block. Note also that the plaintext cannot be directly manipulated except by removal of 
blocks from the beginning or the end of the ciphertext. The initialization vector should be different 
for any two messages encrypted with the same key and is preferably randomly chosen. It does not 
have to be encrypted and it can be transmitted with (or considered as the first part of) the ciphertext. 
However, consider the vulnerability described in Question 2.1.4.5} 


The speed of encryption is identical to that of the block cipher, but the encryption process cannot 
be easily parallelized, although the decryption process can be. 


PCBC mode is a variation on the CBC mode of operation and is designed to extend or propagate a 
single bit error in the ciphertext. This allows errors in transmission to be captured and the resultant 
plaintext to be rejected. The method of encryption is given by 


ci = Ex(ci-1 O mi-1 O mi) 
and decryption is achieved by computing 
Mi = C¡-1 O M¡-1 0 Dy.(c;). (2.1) 


There is a flaw in PCBC [Koh90], which may serve as an instructive example on cryptanalysis (see 
Section 2.4) of block ciphers. If two ciphertext blocks c;_2 and c;_1 are swapped, then the result of 
the ith step in the decryption still yields the correct plaintext block. More precisely, by (2.1) we have 


mi = Dy(c;) O (ci-1 O De (Ci-1)) O (c;-2 O Du (Ci_-2)) O C¡-3 O Mi-3- 


As a consequence, swapping two consecutive ciphertext blocks (or, more general, scrambling k 
consecutive ciphertext blocks) does not affect anything but the decryption of the corresponding 
plaintext blocks. Though the practical consequences of this flaw are not obvious, PCBC was replaced 
by CBC mode in Kerberos version 5. In fact, the mode has not been formally published as a federal 
or national standard. 
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2.1.4.4 Whatis Cipher Feedback Mode? 


In CFB mode (see Figure 2.4), the previous ciphertext block is encrypted and the output produced is 
combined with the plaintext block using XOR to produce the current ciphertext block. It is possible 
to define CFB mode so it uses feedback that is less than one full data block. An initialization vector 
Co is used as a “seed” for the process. 
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Figure 2.4: Cipher Feedback Mode. 


CFB mode is as secure as the underlying cipher and plaintext patterns are concealed in the ciphertext 
by the use of the XOR operation. Plaintext cannot be manipulated directly except by the removal 
of blocks from the beginning or the end of the ciphertext; see next question for some additional 
comments. With CFB mode and full feedback, when two ciphertext blocks are identical, the outputs 
from the block cipher operation at the next step are also identical. This allows information about 
plaintext blocks to leak. The security considerations for the initialization vector are the same as in 
CBC mode, except that the attack described in Question 2.1.4.5] is not applicable. Instead, the last 
ciphertext block can be attacked. 


When using full feedback, the speed of encryption is identical to that of the block cipher, but the 
encryption process cannot be easily parallelized. 


2.1.4.5 Whatis Output Feedback Mode? 


OFB mode (see Figure [2.5) is similar to CFB mode except that the quantity XORed with each 
plaintext block is generated independently of both the plaintext and ciphertext. An initialization 
vector sy is used as a “seed” for a sequence of data blocks s;, and each data block s; is derived from 
the encryption of the previous data block s;_;. The encryption of a plaintext block is derived by 
taking the XOR of the plaintext block with the relevant data block. 
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Figure 2.5: Output Feedback Mode. 


Feedback widths less than a full block are not recommended for security [DP83] [Jues3]. OFB mode 
has an advantage over CFB mode in that any bit errors that might occur during transmission are 
not propagated to affect the decryption of subsequent blocks. The security considerations for the 
initialization vector are the same as in CFB mode. 


A problem with OFB mode is that the plaintext is easily manipulated. Namely, an attacker who 
knows a plaintext block m; may replace it with a false plaintext block x by XORing m; © x to the 
corresponding ciphertext block c;. There are similar attacks on CBC and CFB modes, but in those 
attacks some plaintext block will be modified in a manner unpredictable by the attacker. Yet, the 
very first ciphertext block (that is, the initialization vector) in CBC mode and the very last ciphertext 
block in CFB mode are just as vulnerable to the attack as the blocks in OFB mode. Attacks of this 
kind can be prevented using for example a digital signature scheme (see Question ot a MAC 


scheme (see Question 2.1.7). 


The speed of encryption is identical to that of the block cipher. Even though the process cannot 
easily be parallelized, time can be saved by generating the keystream before the data is available for 
encryption. 


Due to shortcomings in OFB mode, Diffie has proposed [Bra88] an additional mode of operation, 
termed the counter mode. It differs from OFB mode in the way the successive data blocks are 
generated for subsequent encryptions. Instead of deriving one data block as the encryption of the 
previous data block, Diffie proposed encrypting the quantity i + IV mod 2 for the ith data block, 
where IV is some initialization vector. 
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2.1.5 Whatis a stream cipher? 


A stream cipher is a type of symmetric encryption algorithm. Stream ciphers can be designed to 
be exceptionally fast, much faster than any block cipher (see Question 2.1.4). While block ciphers 
operate on large blocks of data, stream ciphers typically operate on smaller units of plaintext, usually 
bits. The encryption of any particular plaintext with a block cipher will result in the same ciphertext 
when the same key is used. With a stream cipher, the transformation of these smaller plaintext units 
will vary, depending on when they are encountered during the encryption process. 


A stream cipher generates what is called a Reystream (a sequence of bits used as a key). Encryption is 
accomplished by combining the keystream with the plaintext, usually with the bitwise XOR operation. 
The generation of the keystream can be independent of the plaintext and ciphertext, yielding what is 
termed a synchronous stream cipher, or it can depend on the data and its encryption, in which case the 
stream cipher is said to be se/-synchronizing. Most stream cipher designs are for synchronous stream 
ciphers. 

Current interest in stream ciphers is most commonly attributed to the appealing theoretical 
properties of the one-time pad. A one-time pad, sometimes called the Vernam cipher (Wer20], uses a 
string of bits that is generated completely at random. The keystream is the same length as the plaintext 
message and the random string is combined using bitwise XOR with the plaintext to produce the 
ciphertext. Since the entire keystream is random, even an opponent with infinite computational 
resources can only guess the plaintext if he or she sees the ciphertext. Such a cipher is said to offer 
perfect secrecy, and the analysis of the one-time pad is seen as one of the cornerstones of modern 
cryptography [Sha49]. While the one-time pad saw use during wartime over diplomatic channels 
requiring exceptionally high security, the fact that the secret key (which can be used only once) is as 
long as the message introduces severe key management problems (see Section 4.1). While perfectly 
secure, the one-time pad is in general impractical. 


Stream ciphers were developed as an approximation to the action of the one-time pad. While 
contemporary stream ciphers are unable to provide the satisfying theoretical security of the one-time 
pad, they are at least practical. 


As of now there is no stream cipher that has emerged as a de facto standard. The most widely used 
stream cipher is RC4 (see Question B.6.3). Interestingly, certain modes of operation of a block cipher 
effectively transform it into a keystream generator and in this way, any block cipher can be used as 
a stream cipher; as in DES in CFB or OFB modes (see Question 2-1-4] and Section B.2). However, 
stream ciphers with a dedicated design ate typically much faster. 


More information about stream ciphers and the various available algorithms can be found in 
almost any book on contemporary cryptography and in RSA Laboratories Technical Report TR-701 


[Rob95a]. 


2.1.5.1 Whatis a Linear Feedback Shift Register? 


A Linear Feedback Shift Register (LFSR) is a mechanism for generating a sequence of binary bits. The 
register (see Figure 2.0) consists of a series of cells that are set by an initialization vector that is, most 
often, the secret key. The behavior of the register is regulated by a counter (in hardware this counter 
is often referred to as a “clock”). At each instant, the contents of the cells of the register are shifted 
right by one position, and the XOR of a subset of the cell contents is placed in the leftmost cell. One 
bit of output is usually derived during this update procedure. 
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Figure 2.6: Linear Feedback Shift Register (LFSR). 


LFSRs are fast and easy to implement in both hardware and software. With a judicious choice of 
feedback taps (the particular bits that are used, in Figure 2.0, the first and fifth bits are “tapped”) 
the sequences that ate generated can have a good statistical appearance. However, the sequences 
generated by a single LFSR are not secure because a powerful mathematical framework has been 
developed over the years which allows for their straightforward analysis. However, LFSRs are useful 
as building blocks in more secure systems. 


A shift register cascade is a set of LFSRs connected together in such a way that the behavior of one 
particular LFSR depends on the behavior of the previous LFSRs in the cascade. This dependent 
behavior is usually achieved by using one LFSR to control the counter of the following LFSR. 
For instance, one register might be advanced by one step if the preceding register output is 1 and 
advanced by two steps otherwise. Many different configurations are possible and certain parameter 


choices appear to offer very good security. For more detail, see an excellent survey article by Gollman 


and Chambers [GC89]. 


The shrinking generator was developed by Coppersmith, Krawczyk, and Mansour [CKM94]. It is a 
stream cipher based on the simple interaction between the outputs from two LFSRs. The bits of one 
output are used to determine whether the corresponding bits of the second output will be used as 
part of the overall keystream. The shrinking generator is simple and scaleable, and has good security 
properties. One drawback of the shrinking generator is that the output rate of the keystream will 
not be constant unless precautions are taken. A variant of the shrinking generator is the se/-shrinking 
generator [MSO5b], where instead of using one output from one LFSR to “shrink” the output of 
another (as in the shrinking generator), the output of a single LFSR is used to extract bits from the 
same output. There are as yet no results on the cryptanalysis of either technique. 
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2.1.6 Whatis a hash function? 


A hash function H is a transformation that takes an input m and returns a fixed-size string, which is 
called the hash value h (that is, h = H(m)). Hash functions with just this property have a variety 
of general computational uses, but when employed in cryptography, the hash functions are usually 
chosen to have some additional properties. 


The basic requirements for a cryptographic hash function are as follows. 
e The input can be of any length. 
e The output has a fixed length. 
e H(z) is relatively easy to compute for any given zx. 
e H(x) is one-way. 
e H(z) is collision-free. 


A hash function HH is said to be one-way if it is hard to invert, where “hard to invert” means that given 
a hash value A, it is computationally infeasible to find some input x such that H(z) = h. If, given a 
message x, it is computationally infeasible to find a message y not equal to x such that H(x) = H(y), 
then H is said to be a weakly collision-free hash function. A strongly collision-free hash function H is one 
for which it is computationally infeasible to find any two messages x and y such that H(z) = H(y). 


For more information and a particularly thorough study of hash functions, see Preneel [Pre3}. 


The hash value represents concisely the longer message or document from which it was computed; 
this value is called the message digest. One can think of a message digest as a “digital fingerprint” of the 
larger document. Examples of well known hash functions are MD2 and MD5 (see Question B.6.0) 
and SHA (see Question B.6.5). 


Perhaps the main role of a cryptographic hash function is in the provision of message integrity checks 
and digital signatures. Since hash functions are generally faster than encryption or digital signature 
algorithms, it is typical to compute the digital signature or integrity check to some document by 
applying cryptographic processing to the document’s hash value, which is small compared to the 
document itself. Additionally, a digest can be made public without revealing the contents of the 
document from which it is derived. This is important in digital timestamping (see Question [7.11) 
where, using hash functions, one can get a document timestamped without revealing its contents to 
the timestamping service. 


Damgard and Merkle [DamY90] [Mer904] greatly influenced cryptographic hash function design by 
defining a hash function in terms of what is called a compression function. A compression function 
takes a fixed-length input and returns a shorter, fixed-length output. Given a compression function, 
a hash function can be defined by repeated applications of the compression function until the entire 
message has been processed. In this process, a message of arbitrary length is broken into blocks 
whose length depends on the compression function, and “padded” (for security reasons) so the size 
of the message is a multiple of the block size. The blocks are then processed sequentially, taking as 


Last 
Message 
Block 


Message Message 
Block 1 Block 2 


Initial 
Value 


Figure 2.7: Damgard/ Merkle iterative structure for hash functions; F is a compression function. 


input the result of the hash so far and the current message block, with the final output being the 
hash value for the message (see Figure 2.7). 


y 
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2.1.7 What are Message Authentication Codes? 


A message authentication code (MAC) is an authentication tag (also called a checksum) derived by 


applying an authentication scheme, together with a secret key, to a message. Unlike digital signatures, 


MACs ate computed and verified with the same key, so that they can only be verified by the intended 


recipient. There are four types of MACs: (1) unconditionally secure, (2) hash function-based, (3) 


stream cipher-based, or (4) block cipher-based. 


(1) 


(2 


NS 


3 


we 


(4) 


Simmons and Stinson [St5] proposed an unconditionally secure MAC based on encryption with 
a one-time pad. The ciphertext of the message authenticates itself, as nobody else has access to 
the one-time pad. However, there has to be some redundancy in the message. An unconditionally 
secure MAC can also be obtained by use of a one-time secret key. 


Hash function-based MACs (often called HMACs) use a key or keys in conjunction with a hash 
function (see Question 2.1.0) to produce a checksum that is appended to the message. An 
example is the keyed-MD5 (see Question B.6.6) method of message authentication [KKR95B]. 


Lai, Rueppel, and Woolven [LRW92] proposed a MAC based on stream ciphers (see Question 
2.1.5). In their algorithm, a provably secure stream cipher is used to split a message into two 
substreams and each substream is fed into a LFSR; the checksum is the final state of the two 
LFSRs. 


MACs can also be derived from block ciphers (see Question 2.1.4). The DES-CBC MAC is a 
widely used U.S. and international standard [INIS85]. The basic idea is to encrypt the message 
blocks using DES CBC and output the final block in the ciphertext as the checksum. Bellare et 
al. give an analysis of the security of this MAC [BKR092]. 


2.1.8 What are interactive proofs and zero-knowledge proofs? 


Informally, an interactive proof is a protocol between two parties in which one party, called the prover, 
tries to prove a certain fact to the other party, called the verifier. An interactive proof usually takes the 
form of a challenge-response protocol, in which the prover and the verifier exchange messages and 
the verifier outputs either “accept” or “reject” at the end of the protocol. Apart from their theoretical 
interest, interactive proofs have found applications in cryptography and computer security such as 
identification and authentication. In these situations, the fact to be proved is usually related to the 
prover's identity, such as the prover's private key. 


It is useful for interactive proofs to have the following properties, especially in cryptographic 
applications: 


e Completeness. The verifier always accepts the proof if the fact is true and both the prover 
and the verifier follow the protocol. 


e Soundness. The verifier always rejects the proof if the fact is false, as long as the verifier 
follows the protocol. 


e Zero knowledge. The verifier learns nothing about the fact being proved (except that it is 
correct) from the prover that he could not already learn without the prover, even if the verifier 
does not follow the protocol (as long as the prover does). In a zero-knowledge proof, the 
verifier cannot even later prove the fact to anyone else. (Not all interactive proofs have this 


property.) 


A typical round in a zero-knowledge proof consists of a “commitment” message from the prover, 
followed by a challenge from the verifier, and then a response to the challenge from the prover. The 
protocol may be repeated for many rounds. Based on the prover's responses in all the rounds, the 
verifier decides whether to accept or reject the proof. 


Figure 2.8: Ad Baba's Cave. 


Let us consider an intuitive example called Ali Baba’s Cave [QG90] (see Figure 2.8). Alice wants to 
prove to Bob that she knows the secret words that will open the portal at R — S in the cave, but she 
does not wish to reveal the secret to Bob. In this scenario, Alice’s commitment is to go to Ror S.A 


IF 
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typical round in the proof proceeds as follows: Bob goes to P and waits there while Alice goes to R 
ot S. Bob then goes to Q and shouts to ask Alice to appear from either the right side or the left side 
of the tunnel. If Alice does not know the secret words (for example, “Open Sesame”), there is only a 
50 percent chance she will come out from the right tunnel. Bob will repeat this round as many times 
as he desires until he is certain Alice knows the secret words. No matter how many times the proof 
repeats, Bob does not learn the secret words. 


There are a number of zero-knowledge and interactive proof protocols in use today as identification 
schemes. The Fiat-Shamir protocol [ES87] is the first practical zero-knowledge protocol with 
ctyptographic applications and is based on the difficulty of factoring. A more common variation of 
the Fiat-Shamir protocol is the Feige-Fiat-Shamir scheme [FFES88]. Guillou and Quisquater 
further improved Fiat-Shamir’s protocol in terms of memory requirements and interaction (the 
number of rounds in the protocol). 


Identification schemes based on interactive proofs can usually be transformed into digital signature 


schemes (see Question and [ES87)). 


2.1.9 What are secret sharing schemes? 


Secret sharing schemes were discovered independently by Blakley [Bla79] and Shamir [Sha79]. The 
motivation for secret sharing is secure key management. In some situations, there is usually one 
secret key that provides access to many important files. If such a key is lost (for example, the person 
who knows the key becomes unavailable, or the computer which stores the key is destroyed), then 
all the important files become inaccessible. The basic idea in secret sharing is to divide the secret key 
into pieces and distribute the pieces to different persons in a group so that certain subsets of the 
group can get together to recover the key. 


As a very simple example, consider the following scheme that includes a group of n people. Each 
person is given a share s;, which is a random bit string of a fixed specified length. The secret is the 
bit string 

S= $10 82 P: O Sn. 


Note that all shares are needed to recover the secret. 


A general secret sharing scheme specifies the minimal sets of users who are able to recover the secret 
by sharing their secret information. A common example of secret sharing is an m-out-of-n scheme 
(or (m, n)-threshold scheme) for integers 1 < m < n. In such a scheme, there is a sender (or dealer) 
and n participants. The sender divides the secret into n parts and gives each participant one part so 
that any m parts can be put together to recover the secret, but any m — 1 parts do not suffice to 
determine the secret. The pieces are usually called shares or shadows. Different choices for the values 
of m and n reflect the tradeoff between security and reliability. An m-out-of-n secret sharing scheme 
is perfect if any group of at most m — 1 participants (insiders) cannot determine more information 
about the secret than an outsider. The simple example above is a perfect n-out-of-n secret sharing 
scheme. 


Both Shamir's scheme and Blakley’s scheme (see Question are m-out-of-n secret sharing 
schemes, and Shamir's scheme is perfect in the sense just described. They represent two different 
ways of constructing such schemes, based on which more advanced secret sharing schemes can be 
designed. The study of the combination of proactive techniques (see Question [7.16) with secret 
sharing schemes is an active area of research. For further information on secret sharing schemes, see 


[Sim]. 
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2.2 SIMPLE APPLICATIONS OF CRYPTOGRAPHY 
2.2.1 Whatis privacy? 


Privacy is perhaps the most obvious application of cryptography. Cryptography can be used to 
implement privacy simply by encrypting the information intended to remain private. In order for 
someone to read this private data, one must first decrypt it. Note that sometimes information is 
not supposed to be accessed by anyone, and in these cases, the information may be stored in such 
a way that reversing the process is virtually impossible. For instance, on a typical multi-user system, 
no one is supposed to know the list of passwords of everyone on the system. Often hash values of 
passwords are stored instead of the passwords themselves. This allows the users of the system to be 
confident their private information is actually kept private while still enabling an entered password 
to be verified (by computing its hash and comparing that result against a stored hash value). 


2.2.2 Whatis a digital signature and what is authentication? 


Authentication is any process through which one proves and verifies certain information. Sometimes 
one may want to verify the origin of a document, the identity of the sender, the time and date a 
document was sent and/or signed, the identity of a computer or user, and so on. A digital signature 
is a cryptographic means through which many of these may be verified. The digital signature of a 
document is a piece of information based on both the document and the signet’s private key. It 
is typically created through the use of a hash function (see Question 2.1.0) and a private signing 
function (encrypting with the signer's private key), but there are other methods. 


Every day, people sign their names to letters, credit card receipts, and other documents, demonstrating 
they are in agreement with the contents. That is, they authenticate that they are in fact the sender 
or originator of the item. This allows others to verify that a particular message did indeed originate 
from the signer. However, this is not foolproof, since people can ‘lift’ signatures off one document 
and place them on another, thereby creating fraudulent documents. Written signatures are also 
vulnerable to forgery because it is possible to reproduce a signature on other documents as well as 
to alter documents after they have been signed. 


Digital signatures and hand-written signatures both rely on the fact that it is very hard to find two 
people with the same signature. People use public-key cryptography to compute digital signatures by 
associating something unique with each person. When public-key cryptography is used to encrypt 
a message, the sender encrypts the message with the public key of the intended recipient. When 
public-key cryptography is used to calculate a digital signature, the sender encrypts the “digital 
fingerprint” of the document with his or her own private key. Anyone with access to the public key 
of the signer may verify the signature. 


Suppose Alice wants to send a signed document or message to Bob. The first step is generally to 
apply a hash function to the message, creating what is called a message digest. The message digest is 
usually considerably shorter than the original message. In fact, the job of the hash function is to take 
a message of arbitrary length and shrink it down to a fixed length. To create a digital signature, one 
usually signs (encrypts) the message digest as opposed to the message itself. This saves a considerable 
amount of time, though it does create a slight insecurity (addressed below). Alice sends Bob the 
encrypted message digest and the message, which she may or may not encrypt. In order for Bob to 
authenticate the signature he must apply the same hash function as Alice to the message she sent 
him, decrypt the encrypted message digest using Alice’s public key and compare the two. If the two 
are the same he has successfully authenticated the signature. If the two do not match there are a few 
possible explanations. Either someone is trying to impersonate Alice, the message itself has been 
altered since Alice signed it or an error occurred during transmission. 


There is a potential problem with this type of digital signature. Alice not only signed the message she 
intended to but also signed all other messages that happen to hash to the same message digest. When 
two messages hash to the same message digest it is called a collision; the collision-free properties of 
hash functions (see Question 2.1.0) are a necessary security requirement for most digital signature 
schemes. A hash function is secure if it is very time consuming, if at all possible, to figure out the 
original message given its digest. However, there is an attack called the birthday attack that relies on 
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the fact that it is easier to find two messages that hash to the same value than to find a message that 
hashes to a particular value. Its name arises from the fact that for a group of 23 or more people the 
probability that two or more people share the same birthday is better than 50%. How the birthday 
paradox can be applied to cryptanalysis is described in the answer to Question 2.4.4, 


In addition, someone could pretend to be Alice and sign documents with a key pair he claims is 
Alice’s. To avoid scenarios such as this, there are digital documents called certificates that associate a 
person with a specific public key. For more information on certificates, see Question f.1.3.10, 


Digital timestamps may be used in connection with digital signatures to bind a document to a 
particular time of origin. It is not sufficient to just note the date in the message, since dates on 
computers can be easily manipulated. It is better that timestamping is done by someone everyone 
trusts, such as a certifying authority (see Question f.1.3.12). There have been proposals suggesting 
the inclusion of some unpredictable information in the message such as the exact closing share price 
of a number of stocks; this information should prove that the message was created afer a certain 
point in time. 


2.2.3 Whatis a key agreement protocol? 


A key agreement protocol, also called a key exchange protocol, is a series of steps used when two or 
more parties need to agree upon a key to use for a secret-key cryptosystem. These protocols 
allow people to share keys freely and securely over any insecure medium, without the need for a 
previously-established shared secret. 


Suppose Alice and Bob want to use a secret-key cryptosystem (see Question to communicate 
securely. They first must decide on a shared key. Instead of Bob calling Alice on the phone 
and discussing what the key will be, which would leave them vulnerable to an eavesdropper, they 
decide to use a key agreement protocol. By using a key agreement protocol, Alice and Bob may 
securely exchange a key in an insecure environment. One example of such a protocol is called the 
Diffie-Hellman key agreement (see Question B.6.1). In many cases, public-key cryptography is used 
in a key agreement protocol. Another example is the use of digital envelopes (see Question 2.2.4) for 
key agreement. 
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2.2.4 What is a digital envelope? 


When using secret-key cryptosystems, users must first agree on a session key, that is, a secret key to 
be used for the duration of one message or communication session. In completing this task there is 
a tisk the key will be intercepted during transmission. This is part of the key management problem 
(see Section H.T). Public-key cryptography offers an attractive solution to this problem within a 
framework called a digital envelope. 


The digital envelope consists of a message encrypted using secret-key cryptography and an encrypted 
secret key. While digital envelopes usually use public-key cryptography to encrypt the secret key, this 
is not necessary. If Alice and Bob have an established secret key, they could use this to encrypt the 
secret key in the digital envelope. 


Suppose Alice wants to send a message to Bob using secret-key cryptography for message encryption 
and public-key cryptography to transfer the message encryption key. Alice chooses a secret key and 
encrypts the message with it, then encrypts the secret key using Bob’s public key. She sends Bob 
both the encrypted secret key and the encrypted message. When Bob wants to read the message he 
decrypts the secret key, using his private key, and then decrypts the message, using the secret key. 
In a multi-addressed communications environment such as e-mail, this can be extended directly and 
usefully. If Alice’s message is intended for both Bob and Carol, the message encryption key can be 
represented concisely in encrypted forms for Bob and for Carol, along with a single copy of the 
message’s content encrypted under that message encryption key. 


Alice and Bob may use this key to encrypt just one message or they may use it for an extended 
communication. One of the nice features about this technique is they may switch secret keys as 
frequently as they would like. Switching keys often is beneficial because it is more difficult for an 
adversary to find a key that is only used for a short period of time (see Question 4.1.2.3] for more 
information on the life cycle of a key). 


Not only do digital envelopes help solve the key management problem, they increase performance 
(relative to using a public-key system for direct encryption of message data) without sacrificing 
security. The increase in performance is obtained by using a secret-key cryptosystem to encrypt the 
large and variably sized amount of message data, reserving public-key cryptography for encryption of 
short-length keys. In general, secret-key cryptosystems are much faster than public-key cryptosystems. 


The digital envelope technique is a method of key exchange, but not all key exchange protocols use 
digital envelopes (see Question 2.2.3). 


2.2.5 Whatis identification? 


Identification is a process through which one ascertains the identity of another person or entity. In our 
daily lives, we identify our family members, friends, and coworkers by their physical properties, such 
as voice, face or other characteristics. These characteristics, called biometrics (see Question [7.20), 
can only be used on computer networks with special hardware. Entities on a network may also 
identify one another using cryptographic methods. 


An identification scheme allows Alice to identify herself to Bob in such a way that someone listening 
in cannot pose as Alice later. One example of an identification scheme is a zero-knowledge proof (see 
Question P.T.8). Zero knowledge proofs allow a person (or a server, web site, etc.) to demonstrate 
they have a certain piece information without giving it away to the person (or entity) they are 
convincing. Suppose Alice knows how to solve the Rubik’s cube and wants to convince Bob she can 
without giving away the solution. They could proceed as follows. Alice gives Bob a Rubik’s cube 
which he thoroughly messes up and then gives back to Alice. Alice turns away from Bob, solves the 
puzzle and hands it back to Bob. This works because Bob saw that Alice solved the puzzle, but he 
did not see the solution. 


This idea may be adapted to an identification scheme if each person involved is given a “puzzle” 
and its answer. The security of the system relies on the difficulty of solving the puzzles. In the case 
above, if Alice were the only person who could solve a Rubik’s cube, then that could be her puzzle. 
In this scenario Bob is the verifier and is identifying Alice, the prover. 


The idea is to associate with each person something unique; something only that person can 
reproduce. This in effect takes the place of a face or a voice, which are unique factors allowing 
people to identify one another in the physical world. 


Authentication and identification are different. Identification requires that the verifier check the 
information presented against all the entities it knows about, while authentication requires that the 
information be checked for a single, previously identified, entity. In addition, while identification 
must, by definition, uniquely identify a given entity, authentication does not necessarily require 
uniqueness. For instance, someone logging into a shared account is not uniquely identified, but by 
knowing the shared password, they are authenticated as one of the users of the account. Furthermore, 
identification does not necessarily authenticate the user for a particular purpose. 
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2.3 HARD PROBLEMS 
2.3.1 Whatis a hard problem? 


Public-key cryptosystems (see Question 2.1.1) are based on a problem that is in some sense difficult 
to solve. Difficult in this case refers more to the computational requirements in finding a solution 
than the conception of the problem. These problems are called hard problems. Some of the most 
well known examples are factoring, theorem-proving, and the Traveling Salesman Problem (see 


Question 2.3.12). 


There are two major classes of problems that interest cryptographers -- P and NP. Put simply, a 
problem is in P if it can be solved in polynomial time (see Section [A.7), while a problem is in NP if 
the validity of a proposed solution can be checked in polynomial time. An alternative definition of 
NP is found in the glossary in the end of this document. Every problem in P is in NP, but we do not 
know whether P = NP or not. 


For example, the problem of multiplying two numbers is in P. Namely, the number of bit operations 
required to multiply two numbers of bit length k is at most k?, a polynomial. The problem of finding 
a factor of a number is in NP, because a proposed solution can be checked in polynomial time. 
However, it is not known whether this problem is in P. 


The question of whether or not P = NP is one of the most important unsolved problems in all of 
mathematics and computer science. So far, there has been very little progress towards its solution. 
One thing we do have is the concept of an NP-complete problem. A problem X in NP is called 
NP-complete if any other NP problem can be reduced (transformed) into X in polynomial time. If 
some NP-complete problem X can be solved in polynomial time, then every NP problem Y can be 
solved in polynomial time; first reduce Y to X and then solve X. The Traveling Salesman Problem, 
the Knapsack problem, and the Hamiltonian Problem (see Question are a few NP-complete 
problems. 


To prove that P = NP, it would suffice to find a polynomial-time algorithm for one of the 
NP-complete problems. However, it is commonly thought that P 4 NP. If it were to be proved 
that P = NP, we could potentially solve an enormous variety of complex problems quickly without a 
significant advance in computing technology (assuming the reductions between different problems 
are efficient in practice). For more on the theory of computation, we recommend and [LP98]. 


2.3.2 Whatis a one-way function? 


A one-way function is a mathematical function that is significantly easier to compute in one direction 
(the forward direction) than in the opposite direction (the inverse direction). It might be possible, 
for example, to compute the function in the forward direction in seconds but to compute its inverse 
could take months or years, if at all possible. A trapdoor one-way function is a one-way function for 
which the inverse direction is easy given a certain piece of information (the trapdoor), but difficult 
otherwise. 


Public-key cryptosystems are based on (presumed) trapdoor one-way functions. The public key gives 
information about the particular instance of the function; the private key gives information about 
the trapdoor. Whoever knows the trapdoor can compute the function easily in both directions, but 
anyone lacking the trapdoor can only perform the function easily in the forward direction. The 
forward direction is used for encryption and signature verification; the inverse direction is used for 
decryption and signature generation. 


In almost all public-key systems, the size of the key corresponds to the size of the inputs to the 
one-way function; the larger the key, the greater the difference between the efforts necessary to 
compute the function in the forward and inverse directions (for someone lacking the trapdoor). For 
a digital signature to be secure for years, for example, it is necessary to use a trapdoor one-way 
function with inputs large enough that someone without the trapdoor would need many years to 
compute the inverse function (that is, to generate a legitimate signature). 


All practical public-key cryptosystems are based on functions that are believed to be one-way, but 
no function has been proven to be so. This means it is theoretically possible to discover algorithms 
that can compute the inverse direction easily without a trapdoor for some of the one-way functions; 
this development would render any cryptosystem based on these one-way functions insecure and 
useless. On the other hand, further research in theoretical computer science may result in concrete 
lower bounds on the difficulty of inverting certain functions; this would be a landmark event with 
significant positive ramifications for cryptography. 
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2.3.3 What is the factoring problem? 


Factoring is the act of splitting an integer into a set of smaller integers (factors) which, when 
multiplied together, form the original integer. For example, the factors of 15 are 3 and 5; the 
factoring problem is to find 3 and 5 when given 15. Prime factorization requires splitting an integer 
into factors that are prime numbers; every integer has a unique prime factorization. Multiplying two 
ptime integers together is easy, but as far as we know, factoring the product of two (or more) prime 
numbers is much mote difficult. 


Factoring is the underlying, presumably hard problem upon which several public-key cryptosystems 
are based, including the RSA algorithm. Factoring an RSA modulus (see Question B.T.T) would 
allow an attacker to figure out the private key; thus, anyone who can factor the modulus can decrypt 
messages and forge signatures. The security of the RSA algorithm depends on the factoring problem 
being difficult and the presence of no other types of attack. There has been some recent evidence 
that breaking the RSA cryptosystem is not equivalent to factoring [BV98]. It has not been proven 
that factoring must be difficult, and there remains a possibility that a quick and easy factoring method 
might be discovered (see Question 2.3.5), though factoring researchers consider this possibility 
remote. 


It is not necessarily true that a large number is more difficult to factor than a smaller number. For 
example, the number 101°” is easy to factor, while the 155-digit number RSA-155 (see Question 2.3.6) 
was factored after seven months of extensive computations. What is true in general is that a number 
with large prime factors is more difficult to factor than a number with small prime factors (still, the 
running time of some factoring algorithms depends on the size of the number only and not on the 
size of its prime factors). This is why the size of the modulus in the RSA algorithm determines how 
secure an actual use of the RSA cryptosystem is. Namely, an RSA modulus is the product of two 
large primes; with a larger modulus, the primes become larger and hence an attacker needs more time 
to factor it. Yet, remember that a number with large prime factors might possess certain properties 
making it easy to factor. For example, this is the case if the prime factors are very close to each other 


(see Question B.T.5). 


2.3.4 What are the best factoring methods in use today? 


Factoring is a very active field of research among mathematicians and computer scientists; the 
best factoring algorithms are mentioned below with some references and their big-O asymptotic 
efficiencies; O-notation refers to the upper bound on the asymptotic running time of an algorithm 
[CLR90]; a brief description is given in Section [A.7, For textbook treatment of factoring algorithms, 


see [Knu8T] [Kob94] [LL90] [Bre89}. 


Factoring algorithms come in two flavors, special purpose and general purpose; the efficiency of the 
former depends on the unknown factors, whereas the efficiency of the latter depends on the number 
to be factored. Special-purpose algorithms are best for factoring numbers with small factors, but the 
numbers used for the modulus in the RSA cryptosystem do not have any small factors. Therefore, 
general-purpose factoring algorithms are the more important ones in the context of cryptographic 
systems and their security. 


Special-purpose factoring algorithms include the Pollard rho method [Pol75], with expected running 
time O(,/p), and the Pollard p — 1 method [Pol74], with running time O(p’), where p’ is the largest 
prime factor of p— 1. The Pollard p+ 1 method is also a special purpose factoring algorithm, with 
running time O(p’), where p’ is the largest prime factor of p + 1. All of these take an amount of time 
that is exponential in the size (bit length) of p, the prime factor that they find; thus these algorithms 
are too slow for most factoring jobs. The eliptic curve method (ECM) [Len87] is superior to these; its 
asymptotic running time is O(ev 2™?)(""P)), The ECM is often used in practice to find factors of 
randomly generated numbers; it is not fast enough to factor a large modulus of the kind used in the 
RSA cryptosystem. 


The best general-purpose factoring algorithm today is the Number Field Sieve (NFS) [BLP94] [BLZ94], 
which runs in time approximately O(e!-9n ny (nn ne, Previously, the most widely used general- 


purpose algorithm was the Multiple Polynomial Quadratic Sieve (MPOS) [SIST], which has running time 
O(elun) An my 


Recent improvements to the Number Field Sieve make the NFS more efficient than MPQS in 
factoring numbers larger than about 115 digits [DL95], while MPQS is better for small integers. 
While RSA-129 (see Question 2.3.0) was factored using a variation of MPQS, a variant of the NFS 
was used in the recent factoring of RSA-155 (a 155-digit number). It is now estimated that if the 
NES had been used for RSA-129, it would have taken one quarter of the time. Clearly, NFS has 
overtaken MPQS as the most widely used factoring algorithm. 


A “general number” is one with no special form that might make it easier to factor; moduli used 
in the RSA cryptosystem ate created to be general numbers. A number being of a “special form” 
means generally that there is an easy way of expressing it. For example, the number might be a 
Fermat number, which means that it is equal to 22” + 1 for some integer n. The Cunningham Project 
[BLS88] keeps track of the factorizations of numbers with special forms and maintains a “10 Most 
Wanted” list of desired factorizations. A good way to survey current factoring capability of “general 
numbers” is to look at recent results of the RSA Factoring Challenge (see Question 2.3.0). 
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2.3.5 What improvements are likely in factoring capability? 


Factoring (see Question 2.3.3) has become easier over the last 15 years for three reasons: computer 
hardware has become more powerful, computers have become mote plentiful and inexpensive, and 
better factoring algorithms have emerged. 


Hardware improvement will continue inexorably, but it is important to realize hardware improvements 
make the RSA cryptosystem more secure, not less. This is because a hardware improvement that 
allows an attacker to factor a number two digits longer than before will at the same time allow a 
legitimate RSA algorithm user to use a key dozens of digits longer than before. Therefore, although 
the hardware improvement does help the attacker, it helps the legitimate user much more. However, 
there is a danger that in the future factoring will take place using faster machines than are currently 
available, and these machines may be used to attack RSA cryptosystem keys generated in the past. In 
this scenario, the attacker alone benefits from the hardware improvement. This consideration argues 
for using a larger key size today than one might otherwise consider warranted. It also argues for 
replacing one’s key with a longer key every few years, in order to take advantage of the extra security 
offered by hardware improvements. This point holds for other public-key systems as well. 


Recently, the number of computers has increased dramatically. While the computers have become 
steadily more powerful, the increase in their power has not compared to their increase in number. 
Since some factoring algorithms can be done with multiple computers working together, the more 
computers devoted to a problem, the faster the problem can be solved. Unlike the hardware 
improvement factor, prevalence of computers does not make the RSA cryptosystem more secure. 


Better factoring algorithms have been more help to the attacker than have hardware improvements. 
As the RSA cryptosystem and cryptography in general have attracted much attention, so has the 
factoring problem, and many researchers have found new factoring methods or improved upon 
others. This has made factoring easier for numbers of any size, irrespective of the speed of the 
hardware. However, factoring is still a very difficult problem. 


Increasing the key size can offset any decrease in security due to algorithm improvements. In fact, 
between general computer hardware improvements and special-purpose hardware improvements, 
increases in key size (maintaining a constant speed of RSA algorithm operations) have kept pace or 
exceeded increases in algorithm efficiency, resulting in no net loss of security. As long as hardware 
continues to improve at a faster rate than the rate at which the complexity of factoring algorithms 
decreases, the security of the RSA cryptosystem will increase, assuming users regularly increase their 
key sizes by appropriate amounts. The open question is how much faster factoring algorithms can 
get; there could be some intrinsic limit to factoring speed, but this limit remains unknown. However, 
if an “easy” solution to the factoring problem can be found, the associated increase in key sizes will 
render the RSA system impractical. 


Factoring is widely believed to be a hard problem (see Question 2.3.1), but this has not yet been 
proven. Therefore, there remains a possibility that an easy factoring algorithm will be discovered. 
This development, which could seriously weaken the RSA cryptosystem, would be highly surprising 
and the possibility is considered remote by the researchers most active in factoring research. 


There is also the possibility someone will prove factoring is difficult. Such a development, while 
unexpected at the current state of theoretical factoring research, would guarantee the security of the 
RSA cryptosystem beyond a certain key size. 


Even if no breakthroughs are discovered in factoring algorithms, both factoring and discrete 
logarithm problems (see Question 2.3.7) can be solved efficiently on a quantum computer (see 
Question [7.T7) if one is ever developed. 
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2.3.6 Whatis the RSA Factoring Challenge? 


The RSA Factoring Challenge was started in March 1991 by RSA Data Security (now RSA Security) 
to keep abreast of the state of the art in factoring. Since its inception, well over a thousand numbers 
have been factored, with the factorers returning valuable information on the methods they used 
to complete the factorizations. The Factoring Challenge provides one of the largest test-beds for 
factoring implementations and provides one of the largest collections of factoring results from many 
different experts worldwide. In short, this vast pool of information gives us an excellent opportunity 
to compare the effectiveness of different factoring techniques as they are implemented and used in 
practice. Since the security of the RSA public-key cryptosystem relies on the inability to factor large 
numbers of a special type, the cryptographic significance of these results is self-evident. 


The most important result thus far is the factorization of RSA-155 (a number with 155 digits), which 
was completed in August 1999 after seven months. A group consisting of, among several others, 
Arjen K. Lenstra and Herman te Riele performed the necessary computations on 300 workstations 
and PCs. The factorization of this 512-bit number is crucial as 512 is the default key size used for the 
major part of the e-commerce on Internet. The result indicates that a well-organized group of users 
such as distributed.net (see Question 2.4.4) might be able to break a 512-bit key in just a couple of 
days. 


Yet, the practical significance of the factorization of RSA-155 should not be exaggerated. The result 
is very impressive, but the cost for breaking a 512-bit key is still high enough to prevent potential 
attackers to apply the techniques on a wider basis. Consider it as a reminder of the importance of 
choosing sufficiently large key sizes; choosing key sizes of at least 768 bits was recommended by 
RSA Laboratories long before this factorization. 


As a curiosity, we mention that the RSA-155 factorization is 


109417386415705274218097073220403576120037329454492059909138421314763499842889 
34784717997257891267332497625752899781833797076537 244027 146743531593354333897 


1026395928297411057720541965739916759007 16567808038066803341933521790711307779 
* 


106603488380168454820927220360012878679207958575989291522270608237 193062808643. 


For more information about the RSA Factoring Challenges, see 


http://www.rsasecurity.com/rsalabs/challenges/]. 


The challenge is administered by RSA Security with quarterly cash awards. Send e-mail to 


challenge-infoOrsasecurity.com] for more information. For an analysis of early results from the 
factoring challenge, see [ER95]. 


A predecessor to the RSA Factoring Challenge is RSA-129. This number is a 129-digit (426-bit) 
integer published in Martin Gardner's column in Scientific American in 1977; it is not part of the 


RSA Factoring Challenge. A prize of $100 was offered to anybody able to factor the number; it 
was factored in March 1994 by Atkins, Graff, Lenstra, and Leyland [[AGLO5] after eight months of 
extensive computations. 
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2.3.7 Whatis the discrete logarithm problem? 


The discrete logarithm problem applies to mathematical structures called groups; see Section [A.3] for the 
definition of a group. A group is a collection of elements together with a binary operation which 
we will refer to as group multiplication. For a group element g and a number n, let g” denote the 
element obtained by multiplying g by itself n times; g? = g * g, y? = g * g * g, and so on. The discrete 
logarithm problem is as follows: given an element g in a finite group G and another element h € G, 
find an integer x such that g” = h. For example, the solution to the problem 3” = 13 (mod 17) is 4, 
because 34 = 81 = 13 (mod 17). 


Like the factoring problem, the discrete logarithm problem is believed to be difficult and also to be 
the hard direction of a one-way function. For this reason, it has been the basis of several public-key 
ctyptosystems, including the ElGamal system and DSS (see Question and Section B.4). The 
discrete logarithm problem bears the same relation to these systems as factoring does to the RSA 
system: the security of these systems rests on the assumption that discrete logarithms are difficult to 
compute. Although the discrete logarithm problem exists in any group, when used for cryptographic 
purposes the group is usually Z% (see Section A.J. 


The discrete logarithm problem has received much attention in recent yeats; descriptions of some 
of the most efficient algorithms for discrete logarithms over finite fields can be found in [OdI84] 
LL90] [COS89] [Gor93] [GM93]. The best discrete logarithm algorithms have expected running 
times similar to those of the best factoring algorithms. Rivest [Riv92a] has analyzed the expected 
time to solve the discrete logarithm problem both in terms of computing power and cost. 


In general, the discrete logarithm in an arbitrary group of size n can be computed in running time 


O(yn) [Pol74], though in many groups it can be done faster. 


2.3.8 What are the best discrete logarithm methods in use today? 


Currently, the best algorithms to solve the discrete logarithm problem (see Question 2.3.7) are 
broken into two classes: ¿ndex-calculus methods and collision search methods. The two classes of algorithms 
differ in the ways they are applied. Index calculus methods generally require certain arithmetic 
properties to be present in order to be successful, whereas collision search algorithms can be applied 
much more generally. The absence of more properties in elliptic curve groups prevents the more 
powerful index-calculus techniques from being used to attack the elliptic curve analogues of the more 
traditional discrete logarithm based cryptosystems (see Section B.3). 


Index calculus methods are very similar to the fastest current methods for integer factoring and they 
run in what is termed sub-exponential time. They are not as fast as polynomial time algorithms, 
yet they are considerably faster than exponential time methods. There are two basic index calculus 
methods closely related to the quadratic sieve and number field sieve factoring algorithms (see 


Question 2.3.4). 
As of this time, the largest discrete logarithm problem that has been solved was over GF (2593), 


Collision search algorithms have purely exponential running time. The best general method is known 
as the Pollard rho method, so-called because the algorithm produces a trail of numbers that when 
eraphically represented with a line connecting successive elements of the trail looks like the Greek 
letter rho. There is a tail and a loop; the objective is basically to find where the tail meets the loop. 
This method runs in time O(yn) (more accurately, in \/7n/2 steps) where n is the size of the group. 
The largest such problem that has been publicly solved has n ~ 21% (see Question B.5.5). This is the 
best known method of attack for the general elliptic curve discrete logarithm problem. 
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2.3.9 What are the prospects for a theoretical breakthrough in the 


discrete logarithm problem? 


It is impossible to predict when a mathematical breakthrough might occur; this is why they are 
called breakthroughs. Factoring algorithms have been studied for hundreds of years, general discrete 
logarithm algorithms have been extensively studied since the early 1970s, and elliptic curve discrete 
logarithms have been studied since the mid-1980s. Each time a new algorithm has been announced 
it has come more ot less as a surprise to the research community. 


It should be noted that for integer factoring and general discrete logarithms, a “breakthrough” means 
finding a polynomial time algorithm. However, for elliptic curve discrete logarithms, a breakthrough 
may consist of just finding a sub-exponential time method. 


We mention that a solution to the discrete logarithm problem would be applicable to the factoring 
problem (see Question 2.3.3). 


2.3.10 What are elliptic curves? 


Elliptic curves are mathematical constructions from number theory and algebraic geometry, which 
in recent years have found numerous applications in cryptography. 


An elliptic curve can be defined over any field (for example, real, rational, complex), though elliptic 
curves used in cryptography are mainly defined over finite fields. An elliptic curve consists of all 
elements (x, y) satisfying the equation 


y =a +ar+b (2.2) 


together with a single element denoted O called the “point at infinity,” which can be visualized as the 
point at the top and bottom of every vertical line. The elliptic curve formula is slightly different for 
some fields. 


R Da = pi + pr 


Figure 2.9: Elliptic curve addition. 


The set of points on an elliptic curve forms a group under addition, where addition of two points on 
an elliptic curve is defined according to a set of simple rules. For example, consider the two points 
pı and pa in Figure 2.9, Point pı plus point pa is equal to point pa = (x, —y), where (x, y) = p3 is the 
third point on the intersection of the elliptic curve and the line L through pı and p2. The addition 
operation in an elliptic curve is the counterpart to modular multiplication in common public-key 
ctyptosystems, and multiple addition is the counterpart to modular exponentiation. Elliptic curves 
ate covered in more recent texts on cryptography, including an informative text by Koblitz [Kob94]. 
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2.3.11 What are lattice-based cryptosystems? 


Lattice-based cryptosystems are based on NP-complete problems (see Question 2.3.1) involving 
lattices. A lattice can be viewed as the set of all linear combinations with integral coefficients of 
a specified set of elements in a vector space. An example of a lattice is the infinite square grid in 
2-dimensional space consisting of all points with integral coordinates. This lattice is generated by 
integral linear combinations of the vectors (0, 1) and (1, 0). 


Lattice-based methods fall into two basic classes, although the solution methods for both are identical. 
In fact, there are efficient transformations between the two classes. The first class is based on the 
so-called subset sum problem: Given a set of numbers S = {a1, a2, . . . , a) and another number K, 
find a subset of S whose values sum to K. The knapsack problem of Merkle and Hellman [MH78] 


is an example of this. 


Other lattice-based methods require finding short vectors embedded in a lattice or finding points 
in the vector space close to vertices of the lattice or close to vectors embedded in the lattice. The 
method of Ajtai and Dwork [AD97] is an example of this type. 


So far lattice-based methods have not proven effective as a foundation for public-key methods. In 
otder for a lattice-based cryptosystem to be secure, the dimension of the underlying problem has to 
be large. This results in a large key size, rendering encryption and decryption quite slow. Ongoing 
research aims to improve the efficiency of these cryptosystems. 


2.3.12 Whatare some other hard problems? 


There are many other kinds of hard problems. The list of NP-complete problems (see Question 2.3.1) 
is extensive and growing. So far, none of these has been effectively applied towards producing a 
public-key cryptosystem. A few examples of hard problems are the Traveling Salesman Problem, the 
Integer and Mixed Integer Programming Problem, the Graph Coloring Problem, the Hamiltonian 
Path Problem and the Satisfiability Problem for Boolean Expressions. A good introduction to this 


topic may be found in [[AHU7A]. 


The Traveling Salesman Problem is to find a minimal length tour among a set of cities, while visiting 
each one only once. 


The Integer Programming Problem is to solve a Linear Programming problem where some or all of the 
variables are restricted to being integers. 


The Graph Coloring Problem is to determine whether a graph can be colored with a fixed set of colors 
such that no two adjacent vertices have the same color, and to produce such a coloring. 


The Hamiltonian Path Problem is to decide if one can traverse a graph by using each vertex exactly 


once. 


The Satisfiability Problem is to determine whether a Boolean expression in several variables has a 
solution. 


Another hard problem is the Knapsack Problem, a narrow case of the Subset Sum Problem (see 
Question 2.3.11). Attempts have been made to make public-key cryptosystems based on the 
knapsack problem, but none have yielded strong results. The Knapsack problem is to determine 
which subset of a set of objects weighing different amounts has maximal total weight, but still has 
total weight less than the capacity of the “Knapsack.” 
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2.4 CRYPTANALYSIS 
2.4.1 Whatis cryptanalysis? 


Cryptanalysis is the flip-side of cryptography: it is the science of cracking codes, decoding secrets, 
violating authentication schemes, and in general, breaking cryptographic protocols. 


In order to design a robust encryption algorithm or cryptographic protocol, one should use 
ctyptanalysis to find and correct any weaknesses. This is precisely the reason why the most trusted 
encryption algorithms are ones that have been made available to public scrutiny. For example, DES 
(see Section has been exposed to public scrutiny for years, and has therefore been well-trusted, 
while Skipjack (see Question B.6.7) was secret for a long time and is less well-trusted. It is a basic 
tenet of cryptology that the security of an algorithm should not rely on its secrecy. Inevitably, the 
algorithm will be discovered and its weaknesses (if any) will be exploited. 


The various techniques in cryptanalysis attempting to compromise cryptosystems ate referred to as 
attacks. Some attacks are general, whereas others apply only to certain types of cryptosystems. Some 
of the better-known attacks are mentioned in Question 2.4.2, 


2.4.2 Whatare some of the basic types of cryptanalytic attack? 


Cryptanalytic attacks are generally classified into six categories that distinguish the kind of information 
the cryptanalyst has available to mount an attack. The categories of attack are listed here roughly 
in increasing order of the quality of information available to the cryptanalyst, or, equivalently, in 
decreasing order of the level of difficulty to the cryptanalyst. The objective of the cryptanalyst in all 
cases is to be able to decrypt new pieces of ciphertext without additional information. The ideal for 
a ctyptanalyst is to extract the secret key. 


A aiphertext-only attack is one in which the cryptanalyst obtains a sample of ciphertext, without 
the plaintext associated with it. This data is relatively easy to obtain in many scenarios, but a 
successful ciphertext-only attack is generally difficult, and requires a very large ciphertext sample. 
A known-plaintext attack is one in which the cryptanalyst obtains a sample of ciphertext and the 
corresponding plaintext as well. 


A chosen-plaintext attack is one in which the cryptanalyst is able to choose a quantity of plaintext and 
then obtain the corresponding encrypted ciphertext. 


An adaptive-chosen-plaintext attack is a special case of chosen-plaintext attack in which the cryptanalyst 
is able to choose plaintext samples dynamically, and alter his or her choices based on the results of 
previous encryptions. 


A chosen-ciphertext attack is one in which cryptanalyst may choose a piece of ciphertext and attempt 
to obtain the corresponding decrypted plaintext. This type of attack is generally most applicable to 
public-key cryptosystems. 


An adaptive-chosen-ciphertext is the adaptive version of the above attack. A cryptanalyst can mount an 
attack of this type in a scenario in which he has free use of a piece of decryption hardware, but is 
unable to extract the decryption key from it. 


Note that cryptanalytic attacks can be mounted not only against encryption algorithms, but also, 
analogously, against digital signature algorithms (see Question 2.2.2), MACing algorithms (see 
Question 2.1.7), and pseudo-random number generators (see Question 2.5.2). 
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2.4.3 Whatis exhaustive key search? 


Exhaustive key search, or brute-force search, is the basic technique of trying every possible key in 
turn until the correct key is identified. To identify the correct key it may be necessary to possess a 
plaintext and its corresponding ciphertext, or if the plaintext has some recognizable characteristic, 
ciphertext alone might suffice. Exhaustive key search can be mounted on any cipher and sometimes 
a weakness in the key schedule (see Question 2.1.4) of the cipher can help improve the efficiency of 
an exhaustive key search attack. 


Advances in technology and computing performance will always make exhaustive key search an 
increasingly practical attack against keys of a fixed length. When DES (see Section was designed, 
it was generally considered secure against exhaustive key search without a vast financial investment 
in hardware [DH77]. Over the years, however, this line of attack will become increasingly attractive 
to a potential adversary [MWie94]. A useful article on exhaustive key search can be found in the Winter 
1997 issue of CryptoBytes [CR97]. 


Exhaustive key search may also be performed in software running on standard desktop workstations 
and personal computers. While exhaustive search of DES’s 56-bit key space would take tens or 
hundreds of years on the fastest general purpose computer available today, the growth of the Internet 
has made it possible to utilize thousands of machines in a distributed search by partitioning the key 
space and distributing small portions to each of a large number of computers. In this manner and 
using a specially designed supercomputer, a DES key was indeed broken in 22 hours in January 1999 


(see Question 2.4.4). 


The current rate of increase in computing power is such that an 80-bit key should offer an acceptable 
level of security for another 10 or 15 years (consider the conservative estimates in [LVOO)]). In the 
mid-20s, however, an 80-bit key will be as vulnerable to exhaustive search as a 64-bit key is today, 
assuming a halved cost of processing power every 18 months. Absent a major breakthrough in 
quantum computing (see Question [7.T7), it is unlikely that 128-bit keys, such as those used in IDEA 
(see Question B.6.7) and the forthcoming AES (see Section B.3), will be broken by exhaustive search 
in the foreseeable future. 


2.4.4 Whatis the RSA Secret Key Challenge? 


RSA Laboratories started the RSA Secret Key Challenge in January 1997. The goal of the challenges 
is to quantify the security offered by secret-key ciphers (see Question with keys of various 
sizes. The information obtained from these contests is anticipated to be of value to researchers 
and developers alike as they estimate the strength of an algorithm or application against exhaustive 
key-search. 


Initially, thirteen challenges were issued, of which four have been solved as of January 2000. There 
were twelve RC5 challenges and one DES challenge, with key sizes ranging from 40 bits to 128 bits. 
The 56-bit DES challenge and the 40-, 48-, and 56-bit RC5 challenges have all been solved. The 
56-bit RC5 key was found in October 1997 after 250 days of exhaustive key search on 10,000 idle 
computers. The project was part of the Bovine RC5 Effort headed by a group called distributed.net 
and led by Adam L. Beberg, Jeff Lawson, and David McNett. 


In January 1998, RSA Laboratories launched the DES challenge I, which consists of a series of DES 
challenges to be released twice per year. It has been expected that each time the amount of time 
needed to solve the challenge will decrease substantially. Indeed, in February 1998, distributed.net 
solved RSA’s DES Challenge H, using an estimated 50,000 processors to search 85% of the possible 
keys, in 41 days. In July 1998, the supercomputer DES Cracker designed by Electronic Frontier 
Foundation (EFF) was able to crack RSA’s DES Challenge 11-2 in 56 hours. The same computer, 
assisted by 100,000 distributed.net PCs on the Internet, was able to crack DES Challenge HI in only 
22 hours; see 


http://www.eff.org/descracker.html]. 


For more information about the challenges, send email to [challenge-info@rsasecurity. con] or 


visit the web site at 


http://www.rsasecurity.com/rsalabs/challenges/]. 
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2.4.5 What are the most important attacks on symmetric block ciphers? 


There are several attacks which ate specific to block ciphers (see Question 2.1.4). Four such attacks 
are differential cryptanalysis, linear cryptanalysis, the exploitation of weak keys, and algebraic attacks. 


Differential cryptanalysis is a type of attack that can be mounted on iterative block ciphers (see 
Question 2.1.4.1] These techniques were first introduced by Murphy [Mur90] in an attack on FEAL-4 
(see Question B.6.7), but they were later improved and perfected by Biham and Shamir [BSYTa] 
[BS93b] who used them to attack DES (see Section B.2). Differential cryptanalysis is basically a 
chosen plaintext attack (see Question 2.4.2); it relies on an analysis of the evolution of the differences 
between two related plaintexts as they are encrypted under the same key. By careful analysis of the 
available data, probabilities can be assigned to each of the possible keys, and eventually the most 
probable key is identified as the correct one. 


Differential cryptanalysis has been used against a great many ciphers with varying degrees of success. 
In attacks against DES, its effectiveness is limited by very careful design of the S-boxes during 
the design of DES in the mid-1970s [Cop92]. Studies on protecting ciphers against differential 
cryptanalysis have been conducted by Nyberg and Knudsen [NKY95] as well as Lai, Massey, and 
Murphy [EMM92]. Differential cryptanalysis has also been useful in attacking other cryptographic 
primitives such as hash functions (see Section 2.1.6). 


Matsui and Yamagishi [MY92] first devised /inear cryptanalysis in an attack on FEAL (see Ques- 
tion B.6.7). It was extended by Matsui [Mat93] to attack DES (see Section B.2). Linear cryptanalysis 
is a known plaintext attack (see Question which uses a linear approximation to describe the 
behavior of the block cipher. Given sufficient pairs of plaintext and corresponding ciphertext, bits of 
information about the key can be obtained, and increased amounts of data will usually give a higher 
probability of success. 


There have been a variety of enhancements and improvements to the basic attack. Langford and 
Hellman [LH94] introduced an attack called differential-linear cryptanalysis that combines elements 
of differential cryptanalysis with those of linear cryptanalysis. Also, Kaliski and Robshaw [IKR94] 
showed that a linear cryptanalytic attack using multiple approximations might allow for a reduction 
in the amount of data required for a successful attack. Other issues such as protecting ciphers against 


linear cryptanalysis have been considered by Nyberg [Nyb95], Knudsen [Knu93], and O’Conner 
[Oco95]. 


Weak keys are secret keys with a certain value for which the block cipher in question will exhibit 
certain regularities in encryption or, in other cases, a poor level of encryption. For instance, with 
DES (see Section B.2), there are four keys for which encryption is exactly the same as decryption. 
This means that if one were to encrypt twice with one of these weak keys, then the original plaintext 
would be recovered. For IDEA (see Question B.6.7), there is a class of keys for which cryptanalysis 
is greatly facilitated and the key can be recovered. However, in both these cases, the number of 
weak keys is such a small fraction of all possible keys that the chance of picking one at random is 
exceptionally slight. In such cases, they pose no significant threat to the security of the block cipher 
when used for encryption. 


Of course, for other block ciphers, there might well be a large set of weak keys (perhaps even with 
the weakness exhibiting itself in a different way) for which the chance of picking a weak key is too 
large for comfort. In such a case, the presence of weak keys would have an obvious impact on the 
security of the block cipher. 


Algebraic attacks are a class of techniques that rely for their success on block ciphers exhibiting a high 
degree of mathematical structure. For instance, it is conceivable that a block cipher might exhibit a 
group structure (see Section [A.3). If this were the case, then encrypting a plaintext under one key 
and then encrypting the result under another key would always be equivalent to single encryption 
under some other single key. If so, then the block cipher would be considerably weaker, and the use 
of multiple encryption would offer no additional security over single encryption; see [KRS88] for a 
more complete discussion. For most block ciphers, the question of whether they form a group is still 
open. DES, however, is known not to be a group; see Question B.2.5} 
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2.4.6 What are some techniques against hash functions? 


The essential cryptographic properties of a hash function are that it is both one-way and collision-free 
(see Question 2.1.0). The most basic attack we might mount on a hash function is to choose inputs 
to the hash function at random until either we find some input that will give us the target output 
value we are looking for (thereby contradicting the one-way property), or we find two inputs that 
produce the same output (thereby contradicting the collision-free property). 


Suppose the hash function produces an n-bit long output. If we are trying to find some input that 
will produce a given target output value y, then since each output is equally likely we expect to have 
to try on the order of 2” possible input values. 


A birthday attack is a name used to refer to a class of brute-force attacks. If some function, when 
supplied with a random input, returns one of k equally-likely values, then by repeatedly evaluating 
the function for different inputs, we expect to obtain the same output after about 1.2k!/? trials. 


If we are trying to find a collision, then by the birthday paradox we would expect that after trying 
1.2(27/2) possible input values we would have some collision. Van Oorschot and Wiener [VW94] 
showed how such a brute-force attack might be implemented. 


With regard to the use of hash functions in the provision of digital signatures, Yuval [Yuv79] 
proposed the following strategy based on the birthday paradox, where n is the length of the message 
digest: 


e The adversary selects two messages: the target message to be signed and an innocuous message 
that Alice is likely to want to sign. 


e The adversary generates 2”/? variations of the innocuous message (by making, for instance, 
minor editorial changes), all of which convey the same meaning, and their corresponding 
message digests. He then generates an equal number of variations of the target message to be 
substituted. 


e The probability that one of the variations of the innocuous message will match one of the 
variations of the target message is greater than 1/2 according to the birthday paradox. 


e The adversary then obtains Alice’s signature on the variation of the innocuous message. 


e The signature from the innocuous message is removed and attached to the variation of the 
target message that generates the same message digest. The adversary has successfully forged 
the message without discovering the enciphering key. 


Pseudo-collisions are collisions for the compression function (see Question 2.1.0) that lies at the heart 
of an iterative hash function. While collisions for the compression function of a hash function might 
be useful in constructing collisions for the hash function itself, this is not normally the case. While 
pseudo-collisions might be viewed as an unfortunate property of a hash function, a pseudo-collision 
is not equivalent to a collision -- the hash function may still be considered as reasonably secure, 
though its use for new applications tends to be discouraged in favor of pseudo-collision-free hash 
functions. MD5 (see Question B.6.6) is one such example. 


2.4.7 What are the most important attacks on stream ciphers? 


The most typical use of a stream cipher for encryption is to generate a keystream in a way that 
depends on the secret key and then to combine this (typically using bitwise XOR) with the message 
being encrypted. 


lt is imperative the keystream “looks” random; that is, after seeing increasing amounts of the 
keystream, an adversary should have no additional advantage in being able to predict any of the 
subsequent bits of the sequence. While there are some attempts to guarantee this property in a 
provable way, most stream ciphers rely on ad hoc analysis. A necessary condition for a secure stream 
cipher is that it pass a battery of statistical tests which assess (among other things) the frequencies 
with which individual bits or consecutive patterns of bits of different sizes occur. Such tests might 
also check for correlation between bits of the sequence occurring at some time instant and those at 
other points in the sequence. Clearly the amount of statistical testing will depend on the thoroughness 
of the designer. It is a very rare and very poor stream cipher that does not pass most suites of 
statistical tests. 


A keystream might potentially have structural weaknesses that allow an adversary to deduce some 
of the keystream. Most obviously, if the period of a keystream, that is, the number of bits in the 
keystream before it begins to repeat again, is too short, the adversary can apply discovered parts of 
the keystream to help in the decryption of other parts of the ciphertext. A stream cipher design 
should be accompanied by a guarantee of the minimum period for the keystreams that might be 
generated or alternatively, good theoretical evidence for the value of the lower bound to such a 
period. Without this, the user of the cryptosystem cannot be assured that a given keystream will not 
repeat far sooner than might be required for cryptographic safety. 


A more involved set of structural weaknesses might offer the opportunity of finding alternative ways 
to generate part or even the whole of the keystream. Chief among these approaches might be using 
a linear feedback shift register to replicate part of the sequence. The motivation to use a linear feedback 
shift register is due to an algorithm of Berlekamp and Massey that takes as input a finite sequence 
of bits and generates as output the details of a linear feedback shift register that could be used to 
generate that sequence. This gives rise to the measure of security known as the near complexity of a 
sequence; for a given sequence, the linear complexity is the size of the linear feedback shift register 
that needs to be used to replicate the sequence. Clearly a necessary condition for the security of a 
stream cipher is that the sequences it produces have a high linear complexity. RSA Laboratories 
Technical Report TR-801 [Koc95] describes in more detail some of these issues and also some of 
the other alternative measures of complexity that might be of interest to the cryptographer and 
cryptanalyst. 


Other attacks attempt to recover part of the secret key that was used. Apart from the most obvious 
attack of searching for the key by brute force, a powerful class of attacks can be described by the 
term divide and conquer. During off-line analysis the cryptanalyst identifies some part of the key 
that has a direct and immediate effect on some aspect or component of the generated keystream. By 
performing a brute-force search over this smaller part of the secret key and observing how well the 
sequences generated match the real keystream, the cryptanalyst can potentially deduce the correct 
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value for this smaller fraction of the secret key [Koc95]. This correlation between the keystream 
produced after making some guess to part of the key and the intercepted keystream gives rise to what 
are termed correlation attacks and later the more efficient fast correlation attacks. 


Finally there are some implementation considerations. A synchronous stream cipher allows an 
adversary to change bits in the plaintext without any error-propagation to the rest of the message. If 
authentication of the message being encrypted is required, the use of a cryptographic MAC might be 
advisable. As a separate implementation issue synchronization between sender and receiver might 
sometimes be lost with a stream cipher and some method is required is ensure the keystreams can 
be put back into step. One typical way of doing this is for the sender of the message to intersperse 
synchronization markers into the transmission so only that part of the transmission which lies 
between synchronization markers might be lost. This process however does carry some security 
implications. 


2.4.8 What are the most important attacks on MACs? 


There are a variety of threats to the security of a MAC (see Question 2.1.7). First and most obviously, 
the use of a MAC should not reveal information about the secret key being used. Second, it should 
not be possible for an adversary to forge the correct MAC to some message without knowing the 
secret key -- even after seeing many legitimate message/MAC pairs. Third, it should not be possible 
to replace the message in a message/MAC pair with another message for which the MAC remains 
legitimate. There ate a variety of threat models that depend on different assumptions about the 
data that might be collected. For example, can an adversary control the messages whose MACs are 
obtained, and if so, can the choice be adapted as more data is collected? 


Depending on the design of the MAC there are a variety of different attacks that might apply. Perhaps 
the most important class of attacks is due to Preneel and van Oorschot [PV95]. These attacks 
involve a sophisticated application of the birthday paradox (see Question 2.4.6) to the analysis of 
message/MAC pairs and the attacks have been particularly useful in highlighting structural faults in 
the design of many MACs. Some considerable work was spent in the early to mid-90’s on designing 
MACs based around the use of a hash function. The attacks of Preneel and van Oorschot were 
instrumental in removing many of these flawed designs from consideration. 
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2.4.9 At what point does an attack become practical? 


There is no easy answer to this question as the answer depends on many distinct factors. Not only 
must the work and computational resources required by the cryptanalyst be reasonable, but the 
amount and type of data required for the attack to be successful must also be taken into account. 
Furthermore, the value of the concealed information must be taken into account -- it is reasonable to 
spend a million dollars of effort to uncover something worth more than a million dollars, however, 
any sane attacker would not, for example, invest one million dollars to uncover a secret worth one 


thousand dollars. 


Also, it should be noted that cryptography and security are not equivalent. If a block cipher takes 
seven months of computational effort to crack, but the key can be recovered by bribery or extortion, 


a truly dedicated adversary will probably attempt the latter. 


2.5 SUPPORTING TOOLS IN CRYPTOGRAPHY 
2.5.1 Whatis primality testing? 


Primality testing is the process of proving a number is prime (an integer greater than 1 is prime if it is 
divisible only by itself and 1). It is used in the key generation process for cryptosystems that depend 
on secret prime numbers, such as the RSA system. Probabilistic primality testing is a process that 
proves a number has a high probability of being prime. 


To generate a random prime number, random numbers are generated (see Question 2.5.2) and tested 
for primality until one of them is found to be prime (or very likely to be prime, if probabilistic testing 
is used). 


It is generally recommended to use probabilistic primality testing, which is much quicker than actually 
proving a number is prime. One can use a probabilistic test that determines whether a number is 
prime with arbitrarily small probability of error, say, less than 27100, For further discussion of some 
primality testing algorithms, see [BBC88]. For some empirical results on the reliability of simple 
primality tests, see [Riv9Tal; one can perform very fast primality tests and be extremely confident in 
the results. A simple algorithm for choosing probable primes was analyzed by Brandt and Damgard 


(BD93H). 
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2.5.2 Whatis random number generation? 


Random number generation is used in a wide variety of cryptographic operations, such as key 
generation and challenge/response protocols. A random number generator is a function that outputs 
a sequence of Os and 1s such that at any point, the next bit cannot be predicted based on the previous 
bits. However, true random number generation is difficult to do on a computer, since computers ate 
deterministic devices. Thus, if the same random generator is run twice, identical results are received. 
True random number generators are in use, but they can be difficult to build. They typically take 
input from something in the physical world, such as the rate of neutron emission from a radioactive 
substance or a user's idle mouse movements. 


Because of these difficulties, random number generation on a computer is usually only pseudo- 
random number generation. A pseudo-random number generator produces a sequence of bits that 
has a random looking distribution. With each different seed (a typically random stream of bits used to 
generate a usually longer pseudo-random stream), the pseudo-random number generator generates a 
different pseudo-random sequence. With a relatively small random seed a pseudo-random number 
generator can produce a long apparently random string. 


Pseudo-random number generators are often based on cryptographic functions like block ciphers 
or stream ciphers. For instance, iterated DES encryption starting with a 56-bit seed produces a 
pseudo-random sequence. 


CHAPTER 3 
Techniques in Cryptography 


Cryptographic algorithms are the basic building blocks of cryptographic applications and protocols. 
This chapter presents most of the important encryption algorithms, hash functions, stream ciphers, 
and other basic cryptographic algorithms. 


3.1 RSA 
3.1.1 Whatis the RSA cryptosystem? 


The RSA cryptosystem is a public-key cryptosystem that offers both encryption and digital signatures 
(authentication). Ronald Rivest, Adi Shamir, and Leonard Adleman developed the RSA system in 
1977 [RSA78]; RSA stands for the first letter in each of its inventors’ last names. 


The RSA algorithm works as follows: take two large primes, p and q, and compute their product 
n = pq; n is called the modulus. Choose a number, e, less than n and relatively prime to (p—1)(q—1), 
which means e and (p — 1)(q — 1) have no common factors except 1. Find another number d such 
that (ed — 1) is divisible by (p — 1)(q — 1). The values e and d are called the public and private 
exponents, respectively. The public key is the pair (n, e); the private key is (n, d). The factors p and q 
may be destroyed or kept with the private key. 


It is currently difficult to obtain the private key d from the public key (n, e). However if one could 
factor n into p and q, then one could obtain the private key d. Thus the security of the RSA system 
is based on the assumption that factoring is difficult. The discovery of an easy method of factoring 


would “break” RSA (see Question B.T.3 and Question 2.3.3). 


Here is how the RSA system can be used for encryption and digital signatures (in practice, the actual 


use is slightly different; see Questions B.T-7] and B.T.8): 


Encryption 
Suppose Alice wants to send a message m to Bob. Alice creates the ciphertext c by exponentiating: 
c = mê mod n, where e and n are Bob's public key. She sends c to Bob. To decrypt, Bob also 


d 


exponentiates: m = c| mod n; the relationship between e and d ensures that Bob correctly recovers 


m. Since only Bob knows d, only Bob can decrypt this message. 


Digital Signature 

Suppose Alice wants to send a message m to Bob in such a way that Bob is assured the message is 
both authentic, has not been tampered with, and from Alice. Alice creates a digital signature s by 
exponentiating: s = m? mod n, where d and n are Alice’s private key. She sends m and s to Bob. To 


#3 


74 


Frequently Asked Questions About Today's Cryptography / Chapter 3 


verify the signature, Bob exponentiates and checks that the message m is recovered: m = sê mod n, 
where e and n are Alice’s public key. 


Thus encryption and authentication take place without any sharing of private keys: each person uses 
only another’s public key or their own private key. Anyone can send an encrypted message or verify 


a signed message, but only someone in possession of the correct private key can decrypt or sign a 
message. 


3.1.2 How fastis the RSA algorithm? 


An “RSA operation,” whether encrypting, decrypting, signing, or verifying is essentially a modular 
exponentiation. This computation is performed by a series of modular multiplications. 


In practical applications, it is common to choose a small public exponent for the public key. In fact, 
entire groups of users can use the same public exponent, each with a different modulus. (There are 
some restrictions on the prime factors of the modulus when the public exponent is fixed.) This 
makes encryption faster than decryption and verification faster than signing. With the typical modular 
exponentiation algorithms used to implement the RSA algorithm, public key operations take O(k?) 
steps, private key operations take O(k*) steps, and key generation takes O(k*) steps, where k is the 
number of bits in the modulus. “Fast multiplication” techniques, such as methods based on the Fast 
Fourier Transform (FFT), require asymptotically fewer steps. In practice, however, they are not as 
common due to their greater software complexity and the fact that they may actually be slower for 
typical key sizes. 


The speed and efficiency of the many commercially available software and hardware implementations 


of the RSA algorithm are increasing rapidly; see for the latest 
figures. 


By comparison, DES (see Section and other block ciphers are much faster than the RSA 
algorithm. DES is generally at least 100 times as fast in software and between 1,000 and 10,000 times 
as fast in hardware, depending on the implementation. Implementations of the RSA algorithm will 
probably narrow the gap a bit in coming years, due to high demand, but block ciphers will get faster 
as well. 


For a detailed report on high-speed RSA algorithm implementations, see [Koc94]. 
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3.1.3 What would it take to break the RSA cryptosystem? 


There ate a few possible interpretations of “breaking” the RSA system. The most damaging would 
be for an attacker to discover the private key corresponding to a given public key; this would enable 
the attacker both to read all messages encrypted with the public key and to forge signatures. The 
obvious way to do this attack is to factor the public modulus, n, into its two prime factors, p and 
q. From p, q, and e, the public exponent, the attacker can easily get d, the private exponent. The 
hard part is factoring n; the security of RSA depends on factoring being difficult. In fact, the task of 
recovering the private key is equivalent to the task of factoring the modulus: you can use d to factor 
n, as well as use the factorization of n to find d (see Questions 2.3.4 and 2.3.5] regarding the state of 
the art in factoring). It should be noted that hardware improvements alone will not weaken the RSA 
cryptosystem, as long as appropriate key lengths are used. In fact, hardware improvements should 
increase the security of the cryptosystem (again, see Question 2.3.5). 


Another way to break the RSA cryptosystem is to find a technique to compute eth roots mod n. 
Since c = mê mod n, the eth root of c mod n is the message m. This attack would allow someone to 
recover encrypted messages and forge signatures even without knowing the private key. This attack 
is not known to be equivalent to factoring. No general methods are currently known that attempt 
to break the RSA system in this way. However, in special cases where multiple related messages are 
encrypted with the same small exponent, it may be possible to recover the messages. 


The attacks just mentioned are the only ways to break the RSA cryptosystem in such a way as to be 
able to recover all messages encrypted under a given key. There are other methods, however, that 
aim to recover single messages; success would not enable the attacker to recover other messages 
encrypted with the same key. Some people have also studied whether part of the message can be 
recovered from an encrypted message [ACG84]. 


The simplest single-message attack is the guessed plaintext attack. An attacker sees a ciphertext and 
guesses that the message might be, for example, “Attack at dawn,” and encrypts this guess with the 
public key of the recipient and by comparison with the actual ciphertext, the attacker knows whether 
or not the guess was correct. Appending some random bits to the message can thwart this attack. 
Another single-message attack can occur if someone sends the same message m to three others, who 
each have public exponent e = 3. An attacker who knows this and sees the three messages will be 
able to recover the message m. This attack, and ways to prevent it, are discussed by Hastad [[Has88]. 
Fortunately, this attack can also be defeated by padding the message before each encryption with 
some random bits. There ate also some chosen ciphertext attacks (or chosen message attacks for 
signature forgery), in which the attacker creates some ciphertext and gets to see the corresponding 
plaintext, perhaps by tricking a legitimate user into decrypting a fake message (Davida [Dav8A] and 
Desmedt and Odlyzko [DO86] give some examples). 


For a survey of these and other attacks on the RSA cryptosystem, see [KR95d]. 


Of course, there are also attacks that aim not at the cryptosystem itself but at a given insecure 
implementation of the system; these do not count as “breaking” the RSA system, because it is 
not any weakness in the RSA algorithm that is exploited, but rather a weakness in a specific 


implementation. For example, if someone stores a private key insecurely, an attacker may discover 
it. One cannot emphasize strongly enough that to be truly secure, the RSA cryptosystem requires 
a secure implementation; mathematical security measures, such as choosing a long key size, are not 
enough. In practice, most successful attacks will likely be aimed at insecure implementations and 
at the key management stages of an RSA system. See Section f-1.3| for a discussion of secure key 
management in an RSA system. 
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3.1.4 What are strong primes and are they necessary for the RSA system? 


In the literature pertaining to the RSA algorithm, it has often been suggested that in choosing a key 
pair, one should use so-called “strong” primes p and q to generate the modulus n. Strong primes 
have certain properties that make the product n hard to factor by specific factoring methods; such 
properties have included, for example, the existence of a large prime factor of p — 1 and a large prime 
factor of p + 1. The reason for these concerns is that some factoring methods -- for instance, the 
Pollard p — 1 and p+ 1 methods (see Question 2.3.4) -- are especially suited to primes p such that 
p—1orp-+1 has only small factors; strong primes are resistant to these attacks. Strong primes are 
required in for example ANSI X9.31 (see Question B.3.1). 


However, advances in factoring over the last ten years appear to have obviated the advantage of 
strong primes; the elliptic curve factoring algorithm is one such advance. The new factoring methods 
have as good a chance of success on strong primes as on “weak” primes. Therefore, choosing 
traditional “strong” primes alone does not significantly increase security. Choosing large enough 
primes is what matters. However, there is no danger in using strong, large primes, though it may take 
slightly longer to generate a strong prime than an arbitrary prime. 


It is possible that new factoring algorithms may be developed in the future which once again target 
primes with certain properties. If this happens, choosing strong primes may once again help to 
increase security. 


3.1.5 How large a key should be used in the RSA cryptosystem? 


The size of a key in the RSA algorithm typically refers to the size of the modulus n. The two primes, 
p and q, which compose the modulus, should be of roughly equal length; this makes the modulus 
harder to factor than if one of the primes is much smaller than the other. If one chooses to use a 
768-bit modulus, the primes should each have length approximately 384 bits. If the two primes are 
extremely closd]] or their difference is close to any predetermined amount, then there is a potential 
security risk, but the probability that two randomly chosen primes are so close is negligible. 


The best size for a modulus depends on one’s security needs. The larger the modulus, the greater the 
security, but also the slower the RSA algorithm operations. One should choose a modulus length 
upon consideration, first, of the value of the protected data and how long it needs to be protected, 
and, second, of how powerful one’s potential threats might be. 


A good analysis of the security obtained by a given modulus length is given by Rivest [Riv92al], in 
the context of discrete logarithms modulo a prime, but it applies to the RSA algorithm as well. A 
more recent study of RSA key-size security can be found in an article by Odlyzko [Od195]. Odlyzko 
considers the security of RSA key sizes based on factoring techniques available in 1995 and on 
potential future developments, and he also considers the ability to tap large computational resources 
via computer networks. In 1997, a specific assessment of the security of 512-bit RSA keys shows that 
one may be factored for less than $1,000,000 in cost and eight months of effort [Rob95d]. Indeed, 
the 512-bit number RSA-155 was factored in seven months during 1999 (see Question 2.3.0). This 
means that 512-bit keys no longer provide sufficient security for anything more than very short-term 
security needs. 


RSA Laboratories currently recommends key sizes of 1024 bits for corporate use and 2048 bits for 
extremely valuable keys like the root key pair used by a certifying authority (see Question K.1.3.12). 
Several recent standards specify a 1024-bit minimum for corporate use. Less valuable information 
may well be encrypted using a 768-bit key, as such a key is still beyond the reach of all known key 
breaking algorithms. Lenstra and Verheul [LVOO] give a model for estimating security levels for 
different key sizes, which may also be considered. 


It is typical to ensure that the key of an individual user expires after a certain time, say, two years (see 
Question f.1.3.5). This gives an opportunity to change keys regularly and to maintain a given level 
of security. Upon expiration, the user should generate a new key being sure to ascertain whether any 
changes in cryptanalytic skills make a move to longer key lengths appropriate. Of course, changing a 
key does not defend against attacks that attempt to recover messages encrypted with an old key, so 
key size should always be chosen according to the expected lifetime of the data. The opportunity 
to change keys allows one to adapt to new key size recommendations. RSA Laboratories publishes 
recommended key lengths on a regular basis. 


Users should keep in mind that the estimated times to break the RSA system are averages only. A 
large factoring effort, attacking many thousands of moduli, may succeed in factoring at least one in 


2 
lput m= pia With p < q, we have 0 < m — yn < HQ Since p = m+ Vm? — n, the primes p and q can 
be easily determined if the difference p — q is small. 
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a reasonable time. Although the security of any individual key is still strong, with some factoring 
methods there is always a small chance the attacker may get lucky and factor some key quickly. 


As for the slowdown caused by increasing the key size (see Question B.T.2), doubling the modulus 
length will, on average, increase the time required for public key operations (encryption and signature 
verification) by a factor of four, and increase the time taken by private key operations (decryption 
and signing) by a factor of eight. The reason public key operations are affected less than private key 
operations is that the public exponent can remain fixed while the modulus is increased, whereas the 
length of the private exponent increases proportionally. Key generation time would increase by a 
factor of 16 upon doubling the modulus, but this is a relatively infrequent operation for most users. 


It should be noted that the key sizes for the RSA system (and other public-key techniques) are much 
larger than those for block ciphers like DES (see Section B.2), but the security of an RSA key cannot 
be compared to the security of a key in another system purely in terms of length. 


3.1.6 Could users ofthe RSA system run out of distinct primes? 


As Euclid proved over two thousand years ago, there are infinitely many prime numbers. Because 
the RSA algorithm is generally implemented with a fixed key length, however, the number of primes 
available to a user of the algorithm is effectively finite. Although finite, this number is nonetheless 
very large. The Prime Number Theorem states that the number of primes less than or equal to n 
is asymptotic to n/Inn. Hence, the number of prime numbers of length 512 bits or less is roughly 
10!°°. This is greater than the number of atoms in the known universe. 
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3.1.7 How is the RSA algorithm used for privacy in practice? 


In practice, the RSA system is often used together with a secret-key cryptosystem, such as DES (see 
Section B.2), to encrypt a message by means of an RSA digital envelope (see Question 2.2.4). 


Suppose Alice wishes to send an encrypted message to Bob. She first encrypts the message with 
DES, using a randomly chosen DES key. Then she looks up Bob’s public key and uses it to encrypt 
the DES key. The DES-encrypted message and the RSA-encrypted DES key together form the RSA 
digital envelope and are sent to Bob. Upon receiving the digital envelope, Bob decrypts the DES key 
with his private key, then uses the DES key to decrypt the message itself. This combines the high 
speed of DES with the key management convenience of the RSA system. 


3.1.8 How is the RSA algorithm used for authentication and 


digital signatures in practice? 


The RSA public-key cryptosystem can be used to authenticate (see Question 2.2.2) or identify another 
person or entity. The reason it works well is because each entity has an associated private key which 
(theoretically) no one else has access to. This allows for positive and unique identification. 


Suppose Alice wishes to send a signed message to Bob. She applies a hash function (see Question 2.1.6) 
to the message to create a message digest, which serves as a “digital fingerprint” of the message. 
She then encrypts the message digest with her private key, creating the digital signature she sends 
to Bob along with the message itself. Bob, upon receiving the message and signature, decrypts the 
signature with Alice’s public key to recover the message digest. He then hashes the message with 
the same hash function Alice used and compares the result to the message digest decrypted from 
the signature. If they are exactly equal, the signature has been successfully verified and he can be 
confident the message did indeed come from Alice. If they are not equal, then the message either 
originated elsewhere or was altered after it was signed, and he rejects the message. Anybody who 
reads the message can verify the signature. This does not satisfy situations where Alice wishes to 
retain the secrecy of the document. In this case she may wish to sign the document, then encrypt it 
using Bob’s public key. Bob will then need to decrypt using his private key and verify the signature 
on the recovered message using Alice’s public key. Alternately, if it is necessary for intermediary third 
patties to validate the integrity of the message without being able to decrypt its content, a message 
digest may be computed on the encrypted message, rather than on its plaintext form. 


In practice, the public exponent in the RSA algorithm is usually much smaller than the private 
exponent. This means that verification of a signature is faster than signing. This is desirable because 
a message will be signed by an individual only once, but the signature may be verified many times. 


It must be infeasible for anyone to either find a message that hashes to a given value or to find 
two messages that hash to the same value. If either were feasible, an intruder could attach a false 
message onto Alice’s signature. Hash functions such as MD5 and SHA (see Question [3.6.0 and 
Question B.6.5) have been designed specifically to have the property that finding a match is infeasible, 
and are therefore considered suitable for use in cryptography. 


One or more certificates (see Question #.1.3.10) may accompany a digital signature. A certificate 
is a signed document that binds the public key to the identity of a party. Its purpose is to prevent 
someone from impersonating someone else. If a certificate is present, the recipient (or a third party) 
can check that the public key belongs to a named party, assuming the certifier’s public key is itself 
trusted. 
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3.1.9 Is the RSA cryptosystem currently in use? 


The RSA system is currently used in a wide variety of products, platforms, and industries around 
the world. It is found in many commercial software products and is planned to be in many more. 
The RSA algorithm is built into current operating systems by Microsoft, Apple, Sun, and Novell. In 
hardware, the RSA algorithm can be found in secure telephones, on Ethernet network cards, and 
on smart cards. In addition, the algorithm is incorporated into all of the major protocols for secure 
Internet communications, including S/MIME (see Question 6.1.1), SSL (see Question 6.1.2), and 
S/WAN (see Question B.1.3). It is also used internally in many institutions, including branches of 
the U.S. government, major corporations, national laboratories, and universities. 


At the time of this publication, technology using the RSA algorithm is licensed by over 700 companies. 
The estimated installed base of RSA BSAFE encryption technologies is around 500 million. The 
majority of these implementations include use of the RSA algorithm, making it by far the most widely 
used public-key cryptosystem in the world. This figure is expected to grow rapidly as the Internet 
and the World Wide Web expand. For a list of RSA algorithm licensees, see 


http://www.rsasecurity.com 


3.1.10 Is the RSA system an official standard today? 


The RSA cryptosystem is part of many official standards worldwide. The ISO (International 
Standards Organization) 9796 standard lists RSA as a compatible cryptographic algorithm, as does 
the ITU-T X.509 security standard (see Question 6.3.2). The RSA systemm is part of the Society 
for Worldwide Interbank Financial Telecommunications (SWIFT) standard, the French financial 
industry’s ETEBAC 5 standard, the ANSI X9.31 rDSA standard and the X9.44 draft standard for the 
U.S. banking industry (see Question 6.3.1). The Australian key management standard, AS2805.6.5.3, 
also specifies the RSA system. 


The RSA algorithm is found in Internet standards and proposed protocols including S/MIME (see 
Question 6.1.1), IPSec (see Question 6.1.4), and TLS (the Internet standards-track successor to SSL; 
see Question 5.1.2), as well as in the PKCS standard (see Question 5.3.3) for the software industry. 
The OSI Implementers’ Workshop (OIW) has issued implementers’ agreements referring to PKCS, 
which includes RSA. 


A number of other standards ate currently being developed and will be announced over the next 
few years; many are expected to include the RSA algorithm as either an endorsed or a recommended 
system for privacy and/or authentication. For example, IEEE P1363 (see Question B.3.5) and WAP 
WTES (see Question includes the RSA system. 


A comprehensive survey of cryptography standards can be found in publications by Kaliski [Ka193B] 
and Ford [For94]. 
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3.1.11 Is the RSA system a de facto standard? 


The RSA system is the most widely used public-key cryptosystem today and has often been called a de 
facto standard. Regardless of the official standards, the existence of a de facto standard is extremely 
important for the development of a digital economy. If one public-key system is used everywhere for 
authentication, then signed digital documents can be exchanged between users in different nations 
using different software on different platforms; this interoperability is necessary for a true digital 
economy to develop. Adoption of the RSA system has grown to the extent that standards are being 
written to accommodate it. When the leading vendors of U.S. financial industry were developing 
standards for digital signatures, they first developed ANSI X9.30 (see Question 6.3.1) in 1997 to 
support the federal requirement of using the Digital Signature Standard (see Section B.4). One year 
later they added ANSI X9.31, whose emphasis is on RSA digital signatures to support the de facto 
standard of financial institutions. 


The lack of secure authentication has been a major obstacle in achieving the promise that computers 
would replace paper; paper is still necessary almost everywhere for contracts, checks, official letters, 
legal documents, and identification. With this core of necessary paper transaction, it has not been 
feasible to evolve completely into a society based on electronic transactions. A digital signature is 
the exact tool necessary to convert the most essential paper-based documents to digital electronic 
media. Digital signatures make it possible for passports, college transcripts, wills, leases, checks and 
voter registration forms to exist in the electronic form; any paper version would just be a “copy” of 
the electronic original. The accepted standard for digital signatures has enabled all of this to happen. 


3.2 DES 
3.2.1 Whatis DES? 


DES, an acronym for the Data Encryption Standard, is the name of the Federal Information 
Processing Standard (FIPS) 46-3, which describes the data encryption algorithm (DEA). The DEA 
is also defined in the ANSI standard X9.32. 


DEA is an improvement of the algorithm Lucifer developed by IBM in the early 1970s. While the 
algorithm was essentially designed by IBM, the NSA (see Question and NBS (now NIST; see 
Question played a substantial role in the final stages of the development. The DEA, often 
called DES, has been extensively studied since its publication and is the best known and widely used 
symmetric algorithm in the world. 


The DEA has a 64-bit block size (see Question 2.T.4) and uses a 56-bit key during execution (8 parity 
bits are stripped off from the full 64-bit key). The DEA is a symmettic cryptosystem, specifically 
a 16-round Feistel cipher (see Question 2.1.4) and was originally designed for implementation in 
hardware. When used for communication, both sender and receiver must know the same secret 
key, which can be used to encrypt and decrypt the message, or to generate and verify a message 
authentication code (MAC). The DEA can also be used for single-user encryption, such as to store 
files on a hard disk in encrypted form. In a multi-user environment, secure key distribution may be 
difficult; public-key cryptography provides an ideal solution to this problem (see Question 2.1.3). 


NIST (see Question 6.2.1) has re-certified DES (FIPS 46-1, 46-2, 46-3) every five years. FIPS 46-3 
reaffirms DES usage as of October 1999, but single DES is permitted only for legacy systems. FIPS 
46-3 includes a definition of triple-DES (IDEA, corresponding to X9.52); TDEA is "the FIPS 
approved symmetric algorithm of choice." Within a few years, DES and triple-DES will be replaced 
with the Advanced Encryption Standard (AES, see Section B.3). 


87 


88 


Frequently Asked Questions About Today's Cryptography / Chapter 3 


3.2.2 Has DES been broken? 


No easy attack on DES has been discovered, despite the efforts of researchers over many years. 
The obvious method of attack is a brute-force exhaustive search of the key space; this process takes 
255 steps on average. Early on, it was suggested [DH77] that a rich and powerful enemy could 
build a special-purpose computer capable of breaking DES by exhaustive search in a reasonable 
amount of time. Later, Hellman [Hel80] showed a time-memory tradeoff that allows improvement 
over exhaustive search if memory space is plentiful. These ideas fostered doubts about the security 
of DES. There were also accusations the NSA (see Question had intentionally weakened 
DES. Despite these suspicions, no feasible way to break DES faster than exhaustive search (see 
Question has been discovered. The cost of a specialized computer to perform exhaustive 
search (requiring 3.5 hours on average) has been estimated by Wiener at one million dollars [W1e94]. 
This estimate was recently updated by Wiener [M1e98] to give an average time of 35 minutes for the 
same cost machine. 


The first attack on DES that is better than exhaustive search in terms of computational requirements 
was announced by Biham and Shamir [BS9a] using a new technique known as differential 
cryptanalysis (see Question 2.4.5). This attack requires the encryption of 2% chosen plaintexts 
(see Question 2.4.2); that is, the plaintexts are chosen by the attacker. Although it is a theoretical 
breakthrough, this attack is not practical because of both the large data requirements and the difficulty 
of mounting a chosen plaintext attack. Biham and Shamir have stated they consider DES secure. 


More recently Matsui [Mat94] has developed another attack, known as linear cryptanalysis (see 
Question 2.4.5). By means of this method, a DES key can be recovered by the analysis of 2% 
known plaintexts. The first experimental cryptanalysis of DES, based on Matsui’s discovery, was 
successfully achieved in an attack requiring 50 days on 12 HP 9735 workstations. Clearly, this attack 
is still impractical. 


Most recently, a DES cracking machine was used to recover a DES key in 22 hours; see Question 2.4.4 
The consensus of the cryptographic community is that DES is not secure, simply because 56 bit 
keys are vulnerable to exhaustive search. In fact, DES is no longer allowed for U.S. government use; 
triple-DES (see Question B.2.6) is the encryption standard until AES (see Section B.3) is ready for 
general use. 


3.2.3 How does one use DES securely? 


When using DES, there are several practical considerations that can affect the security of the encrypted 
data. One should change DES keys frequently, in order to prevent attacks that require sustained 
data analysis. In a communications context, one must also find a secure way of communicating the 
DES key from the sender to the receiver. Use of the RSA algorithm (see Section P.T) or some other 
public-key technique for key management solves both these issues: a different DES key is generated 
for each session, and secure key management is provided by encrypting the DES key with the 
receiver's public key. The RSA system, in this circumstance, can be regarded as a tool for improving 
the security of DES (or any other secret-key cipher). 


If one wishes to use DES to encrypt files stored on a hard disk, it is not feasible to frequently change 
the DES keys, as this would entail decrypting and then re-encrypting all files upon each key change. 
Instead, one might employ a master DES key that encrypts the list of DES keys used to encrypt 
the files; one can then change the master key frequently without much effort. Since the master key 
provides a more attractive point of attack than the individual DES keys used on a per file basis, it 
might be prudent to use triple-DES (see Question B.2.6) as the encryption mechanism for protecting 
the file encryption keys. 


DES can be used for encryption in several officially defined modes (see Question 2.1.4), and these 
modes have a variety of properties. ECB (electronic codebook) mode simply encrypts each 64-bit 
block of plaintext one after another under the same 56-bit DES key. In CBC (cipher block chaining) 
mode, each 64-bit plaintext block is bitwise XORed with the previous ciphertext block before being 
encrypted with the DES key. Thus, the encryption of each block depends on previous blocks and 
the same 64-bit plaintext block can encrypt to different ciphertext blocks depending on its context 
in the overall message. CBC mode helps protect against certain attacks, but not against exhaustive 
search or differential cryptanalysis. CFB (cipher feedback) mode allows one to use DES with block 
lengths less than 64 bits. Detailed descriptions of the various DES modes can be found in [NIS80]. 
The OFB mode essentially allows DES to be used as a stream cipher. 


In practice, CBC is the most widely used mode of DES, and it is specified in several standards. For 
additional security, one could use triple encryption with CBC (see Question B.2.6). 
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3.2.4 Should one test for weak keys in DES? 


DES has four weak keys k for which E, (E, (m)) = m (see Question 2.4.5). There are also twelve 
semi-weak keys which come in pairs kı and kz and are such that Ez, (Eka (m)) = m. Since there are 
2% possible DES keys the chance of picking a weak or semi-weak key at random is 279, As long 
as the user-provided key is chosen entirely at random, weak keys can be safely ignored when DES 
is used for encryption. Despite this, some users prefer to test whether a key to be used for DES 
encryption is in fact a weak key. Such a test will have no significant impact on the time required for 


encryption. 


3.2.5 Is DES a group? 


The question here is whether, for two arbitrary keys kı and kg, there is always a third key k such that 
Ej (m) = Ex, (Ex, (m)) 


for all messages m. If this were the case, the set of all keys would form an abstract group, where 
the composition law on kı and kə yields k. This would be very harmful to the security of DES, 
as it would enable a meet-in-the-middle attack whereby a DES key could be found in about 228 
operations, rather than the usual 2% operations (see [KRS88)). It would also render multiple DES 
encryption useless, since encrypting twice with two different keys would be the same as encrypting 
once with a third key. However, DES is not a group. This issue, while strongly supported by initial 
evidence, was finally settled in 1993 [CW93]. The result seems to imply that techniques such as triple 
encryption (see Question B.2.6) do in fact increase the security of DES. 


Formally, the problem can be formulated as follows; see Appendix [A] for mathematical concepts. Let 
M denote the set of all possible messages and let K denote the set of all possible keys. Encryption 
with a key k € K is performed using the permutation Ep : M — M. The set Ex = {Ep : k € K} of 
such permutations is a subset of the group Sm of all permutations M — M. The fact that Ex (that 
is, DES) is not a group is just the fact that Ex generates a subgroup of Sy that is larger than the 
set Ex. In fact, the size of this subgroup is at least 2830 [CW93]. In particular, multiple encryption 
gives a larger key space. 
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3.2.6 Whatis triple-DES? 


For some time it has been common practice to protect information with triple-DES instead of DES. 
This means that the input data is, in effect, encrypted three times. There are a variety of ways of 
doing this; the ANSI X9.52 standard (see Question B.3.1)) defines triple-DES encryption with keys 
kı, ka, kg as 

C = Ex, (Dr, (Ex, (M))), 


where Ej and Dp denote DES encryption and DES decryption, respectively, with the key k. This 
mode of encryption is sometimes referred to as DES-EDE. Another variant is DES-EEE, which 
consists of three consecutive encryptions. There are three keying options defined in ANSI X9.52 for 
DES-EDE: 


e The three keys k1, k2 and kg are independent. 


e kı and k are independent, but kı = kz. 


The third option makes triple-DES backward compatible with DES. 


Like all block ciphers, triple-DES can be used in a variety of modes. ANSI X9.52 details seven such 
modes, including the four standard modes described in Questions 


The use of double and triple encryption does not always provide the additional security that might 
be expected. For example, consider the following meet-in-the-middle attack on double encryption 
[DH77]. We have a symmetric block cipher with key size n; let E(P) denote the encryption of the 
message P using the key k. Double encryption with two different keys gives a total key size of 2n. 
However, suppose that we are capable of storing Ex(P) for all keys k and a given plaintext P, and 
suppose further that we are given a ciphertext C such that C = Ex, (Ex, (P)) for some secret keys 
kı and kg. For each key l, there is exactly one key k such that D¡(C) = Ex(P). In particular, there 
are exactly 2” possible keys yielding the pair (P, C), and those keys can be found in approximately 
O(2”) steps. With the capability of storing only 2? < 2” keys, we may modify this algorithm and find 
all possible keys in O(2?"~”) steps. 


Another example is given in [KSW9O], where triple EDE encryption with three different keys is 
considered. Let K = (ka, ky, ke) and K’ = (ka O A, ky, ke) be two secret keys, where A is a known 
constant and © denotes XOR. Suppose that we ate given a ciphertext C and the corresponding 
decryptions P and P’ of C with the keys K and K’, respectively. Since P’ = Dp, ga (Ek, (P)), we 
can determine ka (or all possible candidates for ka) in O(2”) steps, where n is the key size. Using an 
attack similar to the one described above, we may determine the rest of the key (that is, kẹ and ke) in 
another O(2”) steps. 


Attacks on two-key triple-DES have been proposed by Merkle and Hellman [MH8T] and Van 
Oorschot and Wiener [VWI], but the data requirements of these attacks make them impractical. 
Further information on triple-DES can be obtained from various sources [Bih95] [KR99]. 


3.2.7 Whatis DESX? 


DESX is a strengthened variant of DES supported by RSA Security’s toolkits (see Question 6.2.3). 
The difference between DES and DESX is that, in DESX, the input plaintext is bitwise XORed 
with 64 bits of additional key material before encryption with DES and the output is also bitwise 
XORed with another 64 bits of key material. The security of DESX against differential and linear 
attack (see Question 2.4.5) appears to be equivalent to that of DES with independent subkeys (see 
Question so there is not a great increase in security with regards to these attacks. However the 
main motivation for DESX was in providing a computationally simple way to dramatically improve 
on the resistance of DES to exhaustive key search attacks. This improved security was demonstrated 
in a formal manner by Killian and Rogaway [RK9G] and Rogaway [[Rog96]. The DESX construction 
is due to Rivest. 


93 


94 


Frequently Asked Questions About Today's Cryptography / Chapter 3 


3.2.8 What are some other DES variants? 


G-DES is a variant on DES devised by Schaumuller-Bichl to improve on the performance of DES 
by defining a cipher based on DES with a larger block size, but without an increase in the amount of 
computation required [Sch83]. It was claimed that G-DES was as secure as DES since the cipher was 
based on DES. However, Biham and Shamir showed that G-DES with the recommended parameter 
sizes is easily broken and that any alterations of G-DES parameters that result in a cipher faster than 
DES are less secure than DES [BS93b]. 


Another variant of DES uses independent subkeys. The DES algorithm detives sixteen 48-bit 
subkeys, for use in each of the 16 rounds, from the 56-bit secret key supplied by the user. It is 
interesting to consider the effect of using a 768-bit key (divided into 16 48-bit subkeys) in place of 
the 16 related 48-bit keys that are generated by the key schedule in the DES algorithm. 


While the use of independent subkeys would obviously vastly increase the effort required for 
exhaustive key search, such a change to the cipher would make it only moderately more secure 
against differential and linear cryptanalytic attack (see Question 2.4.5) than ordinary DES. Biham 
estimated that 2°! chosen plaintexts are required for a differential attack on DES with independent 
subkeys, while 2% known plaintexts are required for linear cryptanalysis [Bih95]. 


3.3 AES 
3.3.1 Whatis the AES? 


The AES is the Advanced Encryption Standard. The AES is intended to be issued as a FIPS 
(see Question 6.2.1) standard and will replace DES. Most now agree that this venerable cipher is 
approaching the end of its useful life; DES has not been reaffirmed as a federal standard. In January 
1997 the AES initiative was announced and in September 1997 the public was invited to propose 
suitable block ciphers as candidates for the AES. NIST is looking for a cipher that will remain secure 
well into the next century. 
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3.3.2 Whatare some candidates for the AES? 


There is considerable interest in the AES initiative and 15 candidates were accepted for consideration 
in the first round. Among these were close variants of some of the more popular and trusted 
algorithms currently available, such as RC5 (see Question B.6.4), CAST, and SAFER-SK (see 
Question B.6.7). Other good candidates from well-respected cryptographers were also submitted. 
One of the reasons for close variants being proposed rather than the original ciphers is that one of 
the criteria for the AES submission is the ability to support 128-bit blocks of plaintext. Most ciphers 
were developed with an eye to providing a drop-in replacement for DES and, as a result, were often 
limited to having a 64-bit block size. 


Among the fifteen candidates, five candidates have qualified for a second round. Here is a short 
presentation of the five candidates. 


MARS Submitted by IBM (United States). As opposed to its competitors, IBM has constructed 
an AES candidate that is novel in its design. MARS accepts key sizes up to 448 bits and consists 
of 16 rounds -- the cryptographic core -- wrapped with two 8-round mixing layers. The purpose of 
the mixing rounds is to obtain diffusion, while the cryptographic core is designed to resist against 
all well-known attacks. Basic components in the rounds are common operations such as integer and 
bitwise addition and rotation. Its performance is good or excellent on most platforms with some 
reservations concerning smart card implementations. MARS differs from the other AES finalists 
in that it is not based on a well-reputed algorithm that has been around for several years. Due to 
this fact and the alleged complexity of the algorithm, the security of MARS has been claimed to be 
difficult to estimate. 


RC6 Submitted by RSA Laboratories (United States). RC6 is a parameterized, fast and simple 
algorithm based on the well-trusted RC5 cipher. The AES submission consists of 20 rounds, which 
has been claimed to be a bit low; however, no security gaps have been discovered thus far. The 
algorithm might be less suitable on certain platforms due to its use of 32-bit variable rotations 
and integer multiplications, but when such operations are supported, RC6 is faster than any other 
candidate. RC6 is described in more detail in the answer to Question B.6.4. 


Rijndael Submitted by Joan Daemen and Vincent Rijmen (Belgium). Rijndael is based on the 
algorithm Square and received excellent reviews from NIST in the Round 1 status report -- the 
algorithm is fast, simple, secure, versatile, and well-suited for smart card implementations. For the 
moment, Rijndael appears to have no major disadvantages in comparison with the other candidates. 
Rijndael is unconventional in that its blocks are matrices of elements in GF(25) (see Appendix [A), 
that is, arrays of bytes. In the 128-bit version, Rijndael consists of ten rounds, and in each round the 
individual bytes are transformed, the rows are rotated, and the columns are multiplied to a constant 
matrix. Each round is concluded with an XORing of the resulting array to a round key. 


Serpent Submitted by Ross Anderson (United Kingdom), Eli Biham (Israel), and Lars Knudsen 
(Norway). The keywords for Serpent are conservatism and security rather than novelty and speed; 
the algorithm contains eight S-boxes based on the S-boxes in DES, and the 32 rounds ate arguably 
twice as many as needed to meet the AES security requirements. This makes Serpent easy to trust, 
but the price the algorithm has to pay is a weaker performance compared to the other AES finalists. 


However, due to small memory requirements, Serpent is well-suited for smart card implementations. 
This property helped Serpent knocking out CAST-256 (see Question B.6.7), which is similar in 


performance and security. 


Twofish Submitted by Bruce Schneier, John Kelsey, Doug Whiting, David Wagner, Chris Hall, 
and Niels Ferguson (United States). Twofish is based on Schneier's algorithm Blowfish (see 
Question B.6.7). Twofish is a fast and versatile Feistel network that does not require much memory. 
Yet, the structure of the cipher is very complex and hence difficult to analyze. This makes Twofish 
similar to MARS, but Twofish has the advantage of being based on an already well-studied and 
well-trusted algorithm. 


97 


98 


Frequently Asked Questions About Today's Cryptography / Chapter 3 


3.3.3 Whatis the schedule for the AES? 


It would be surprising if the process for choosing something as important as a block cipher standard 
for the next 20-30 years (which is the intended lifetime of the AES) were not long and involved. 
June 15, 1998 was the last day to submit an algorithm. Following that, there was a period of review 
before five candidates (see the previous question) was chosen for further, more involved scrutiny. 
From these five, it is intended that the AES will be chosen. It is anticipated that the process will be 
completed by the year 2001. 


The current status of the process is that NIST has encouraged the public to provide comments on 
certain AES related issues: 


1. How many AES algorithms? There might be reasons for choosing multiple algorithms instead of 
just a single one. 


2. What about the speed versus security margin tradeoff? The margin tradeoff is the number of rounds 
minus the “security threshold” (the maximal number of rounds for which the algorithm is 
currently considered as insecure), and the question is how small a security margin should be 
allowed to be. 


3. How important are low-end smart cards and related environments when selecting the AES algorithm(s)? 
4. What is the relative importance of hardware vs. software performance in the selection of the AES algorithm(s)? 


5. What modes of operation should be available for the AES algorithm(s)? The current DES standards are 
ECB, CBC, CFB, and OFB; see Questions 


For more information, see the AES home page at 


http://www.nist.gov/aes/]. 


3.4 DSA 
3.4.1 What are DSA and DSS? 


The National Institute of Standards and Technology (NIST) (see Question (6.2.1) published the 
Digital Signature Algorithm (DSA) in the Digital Signature Standard (DSS), which is a part of the 
U.S. government’s Capstone project (see Question 6.2.3). DSS was selected by NIST, in cooperation 
with the NSA (see Question 6.2.2), to be the digital authentication standard of the U.S. government. 
The standard was issued in May 1994. 


DSA is based on the discrete logarithm problem (see Question 2.3.7) and is related to signature 
schemes that were proposed by Schnorr [Sch90] and ElGamal (see Question B.6.8). While the RSA 
system can be used for both encryption and digital signatures (see Question the DSA can only 
be used to provide digital signatures. For a detailed description of DSA, see [NIS945] or [NIS92}. 


In DSA, signature generation is faster than signature verification, whereas with the RSA algorithm, 
signature verification is very much faster than signature generation (if the public and private 
exponents, respectively, are chosen for this property, which is the usual case). It might be claimed 
that it is advantageous for signing to be the faster operation, but since in many applications a piece 
of digital information is signed once, but verified often, it may well be more advantageous to have 
faster verification. The tradeoffs and issues involved have been explored by Wiener [Wie98]. There 
has been work by many authors including Naccache et al. [NMR94] on developing techniques to 
improve the efficiency of DSA, both for signing and verification. 


Although several aspects of DSA have been criticized since its announcement, it is being incorporated 
into a number of systems and specifications. Initial criticism focused on a few main issues: it lacked 
the flexibility of the RSA cryptosystem; verification of signatures with DSA was too slow; the 
existence of a second authentication mechanism was likely to cause hardship to computer hardware 
and software vendors, who had already standardized on the RSA algorithm; and that the process 
by which NIST chose DSA was too secretive and arbitrary, with too much influence wielded by 
the NSA. Other criticisms more related to the security of the scheme were addressed by NIST by 
modifying the original proposal. A more detailed discussion of the various criticisms can be found 


in [NIS92], and a detailed response by NIST can be found in [SBY3]. 
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3.4.2 Is DSA secure? 


The Digital Signature Standard (see Question B.4.T) was originally proposed by NIST with a fixed 
512-bit key size. After much criticism that this is not secure enough, especially for long-term security, 
NIST revised DSS to allow key sizes up to 1024 bits. In fact, even larger key sizes are now allowed 
in ANSI X9.30 [[ANS97]. DSA is, at present, considered to be secure with 1024-bit keys. 


DSA makes use of computation of discrete logarithms in certain subgroups in the finite field GF (p) 
for some prime p. The problem was first proposed for cryptographic use in 1989 by Schnorr [Sch90}. 
No efficient attacks have yet been reported on this form of the discrete logarithm problem. 


Some researchers watned about the existence of “trapdoor” primes in DSA, which could enable 
a key to be easily broken. These trapdoor primes are relatively rare and easily avoided if proper 
key-generation procedures are followed [SB93]. 


3.5 ELLIPTIC CURVE CRYPTOSYSTEMS 
3.5.1 What are elliptic curve cryptosystems? 


Elliptic curve cryptosystems were first proposed independently by Victor Miller [MiI8G] and Neal 
Koblitz [Kob87] in the mid-1980s. At a high level, they ate analogs of existing public-key 
ctyptosystems in which modular arithmetic is replaced by operations defined over elliptic curves 
(see Question 2.3.10). The elliptic curve cryptosystems that have appeared in the literature can be 
classified into two categories according to whether they are analogs to the RSA system or to discrete 
logarithm based systems. 


Just as in all public-key cryptosystems, the security of elliptic curve cryptosystems relies on the 
underlying hard mathematical problems (see Section 2.3). It turns out that elliptic curve analogs 
of the RSA system are mainly of academic interest and offer no practical advantage over the RSA 
system, since their security is based on the same underlying problem, namely integer factorization. 
The situation is quite different with elliptic curve variants of discrete logarithm based systems (see 
Question 2.3.7). The security of such systems depends on the following hard problem: Given two 
points G and Y on an elliptic curve such that Y = kG (that is, Y is G added to itself k times), find 
the integer k. This problem is commonly referred to as the eliptic curve discrete logarithm problem. 


Presently, the methods for computing general elliptic curve discrete logarithms are much less efficient 
than those for factoring or computing conventional discrete logarithms. As a result, shorter key sizes 
can be used to achieve the same security of conventional public-key cryptosystems, which might 
lead to better memory requirements and improved performance. One can easily construct elliptic 
curve encryption, signature, and key agreement schemes by making analogs of ElGamal, DSA, and 
Diffie-Hellman. These variants appear to offer certain implementation advantages over the original 
schemes, and they have recently drawn more and more attention from both the academic community 
and the industry. 


For more information on elliptic curve cryptosystems, see the survey article [RY97] by Robshaw and 
Yin or the CryptoBytes article [Men95] by Alfred Menezes. 
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3.5.2 Are elliptic curve cryptosystems secure? 


In general, the best attacks on the elliptic curve discrete logarithm problems have been general 
brute-force methods. The current lack of more specific attacks means that shorter key sizes for 
elliptic cryptosystems appear to give similar security as much larger keys that might be used in 
cryptosystems based on the discrete logarithm problem and integer factorization. For certain choices 
of elliptic curves there do exist more efficient attacks. Menezes, Okamoto, and Vanstone [MOV90] 
have been able to reduce the elliptic curve discrete logarithm problem to the traditional discrete 
logarithm problem for certain curves, thereby necessitating the same size keys as is used in more 
traditional public-key systems. However these cases are readily classified and easily avoided. 


In 1997, elliptic curve cryptography began to receive a lot more attention; by the end of 1999, there 
were no major developments as to the security of these cryptosystems. The longer this situation 
continues, the more confidence will grow that they really do offer as much security as currently 
appears. However, a sizeable group of very respected researchers have some doubts as to whether this 
situation will remain unchanged for many years. In particular, there is some evidence that the use of 
special elliptic curves, sometimes known as Koblitz curves, which provide very fast implementations, 
might allow new specialized attacks. As a starting point, the basic brute-force attacks can be improved 
when attacking these curves [MWie98]. While RSA Laboratories believes that continued research into 
elliptic curve cryptosystems might eventually create the same level of wide-spread trust as is enjoyed 
by other public-key techniques (provided there are no upsets), the use of special purpose curves will 
most likely always be viewed with extreme skepticism. 


3.5.3 Are elliptic curve cryptosystems widely used? 


Elliptic curve cryptosystems have emerged as a promising new area in public-key cryptography in 
recent years due to their potential for offering similar security to established public-key cryptosystems 
with reduced key sizes. Improvements in various aspects of implementation, including the generation 
of elliptic curves, have made elliptic curve cryptography more practical than when it was first 
introduced in the mid 80's. 


Elliptic curve cryptosystems are especially useful in applications for which memory, bandwidth, or 
computational power is limited. It is expected that the use of elliptic curve cryptosystems in these 
special areas will continue to grow in the future. 


Standards efforts for elliptic curve cryptography are well underway. X9.F.1, an ANSLaccredited 
standards committee for the financial services industry is developing two standards: ANSI X9.62 for 
digital signatures and ANSI X9.63 for key agreement and key transport. IEEE P1363 is working on 
a general reference for public-key techniques from several families, including elliptic curves. 


Recently, NIST recommended a certain set of elliptic curves for government use. This set of curves 
can be divided into two classes: curves over a prime field GF(p) and curves over a binary field 
GF(2™). The curves over GF (p) are of the form 


y = r? — 3r +b 
with b random, while the curves over GF (2°) are either of the form 
yY? + ry =r? +r? +b 
with b random or Kob/itz curves. A Koblitz curve has the form 
y? + zy = r? +az? +1 


with a =0 or 1. 


103 


104 


Frequently Asked Questions About Today's Cryptography / Chapter 3 


3.5.4 How do elliptic curve cryptosystems compare with other cryptosystems? 


The main attraction of elliptic curve cryptosystems over other public-key cryptosystems is the fact 
that they ate based on a different, hard problem. This may lead to smaller key sizes and better 
performance in certain public key operations for the same level of security. 


Very roughly speaking, when this FAQ was published elliptic curve cryptosystems with a 160-bit key 
offer the same security of the RSA system and discrete logarithm based systems with a 1024-bit key. 
As a result, the length of the public key and private key is much shorter in elliptic curve cryptosystems. 
In terms of speed, however, it is quite difficult to give a quantitative comparison, partly because of 
the various optimization techniques one can apply to different systems. It is perhaps fair to say the 
following: Elliptic curve cryptosystems are faster than the corresponding discrete logarithm based 
systems. Elliptic curve cryptosystems are faster than the RSA system in signing and decryption, but 
slower in signature verification and encryption. For more detailed comparisons, see the survey article 


[RY97] by Robshaw and Yin. 


With academic advances in attacking different hard mathematical problems both the security estimates 
for various key sizes in different systems and the performance comparisons between systems are 
likely to change. 


3.5.5 Whatis the Certicom ECC Challenge? 


The Certicom ECC Challenge is the elliptic curve counterpart to the RSA Factoring Challenge (see 
Question 2.3.0). The challenge is to find the private key in an elliptic curve cryptosystem given the 
public key and associated parameters. Mathematically, the challenge is to solve a discrete logarithm 
problem (see Question 2.3.7) in an elliptic curve group (see Question 2.3.10). The competitors may 
choose whether they want to attack a key over GF (2) or over GF (p), where pis a given prime. 


The challenge is divided into three levels. “Level 0” consists of several Exercises with keys consisting 
of 79, 89, and, 97 bits. The key sizes in Level I are 109 and 131, while the key sizes in Level II are 
163, 191, 239, and 359. 


Among the Level I and II challenges, the 109-bit challenges are provably feasible given today’s 
computational power, while the 131-bit challenges seem to be infeasible for the moment; in 
commercial applications, 132 is the minimal key size recommended by Lenstra and Verheul (see 


Question #.1.2.1). 

The Exercise Level was completed in September 1999, when a group led by Robert Harley at INRIA 
in France managed to solve ECC2-97, a challenge concerning an elliptic curve over GF(2%). In 
April 2000, the first Level I challenge (the 109-bit key ECC2K-108) was solved by Harley’s team 
after four months of computations on 9,500 machines. The required computational power has been 


estimated to be about 50 times that required to factor RSA-155 (see Question 2.3.0). 


For more information on the Certicom ECC Challenge, see 


http: //www.certicom.con 
and Harley’s home page at 


http://pauillac.inria.fr/ xharley/ecdl/|. 
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3.6 OTHER CRYPTOGRAPHIC TECHNIQUES 
3.6.1 Whatis Diffie-Hellman? 


The Diffie-Hellman key agreement protocol (also called exponential key agreement) was developed 
by Diffie and Hellman [DH74] in 1976 and published in the ground-breaking paper “New Directions 
in Cryptography.” The protocol allows two users to exchange a secret key over an insecure medium 
without any prior secrets. 


The protocol has two system parameters p and g. They are both public and may be used by all the 
users in a system. Parameter p is a prime number and parameter g (usually called a generator) is an 
integer less than p, with the following property: for every number n between 1 and p— 1 inclusive, 
there is a power k of g such that n = g* mod p. 


Suppose Alice and Bob want to agree on a shared secret key using the Diffie-Hellman key agreement 
protocol. They proceed as follows: First, Alice generates a random private value a and Bob generates 
a random private value b. Both a and b ate drawn from the set of integers {1,...,p — 2}. Then they 
derive their public values using parameters p and g and their private values. Alice’s public value is 
g“ mod p and Bob's public value is g? mod p. They then exchange their public values. Finally, Alice 
computes g® = (g°)* mod p, and Bob computes g”* = (9%)? mod p. Since g% = g”* = k, Alice and 
Bob now have a shared secret key k. 


The protocol depends on the discrete logarithm problem for its security. It assumes that it is 
computationally infeasible to calculate the shared secret key k = g% mod p given the two public 
values g“ mod p and g? mod p when the prime p is sufficiently large. Maurer [Mau94] has shown that 
breaking the Diffie-Hellman protocol is equivalent to computing discrete logarithms under certain 


assumptions. 


The Diffie-Hellman key exchange is vulnerable to a man-in-the-middle attack. In this attack, an 
opponent Carol intercepts Alice’s public value and sends her own public value to Bob. When Bob 
transmits his public value, Carol substitutes it with her own and sends it to Alice. Carol and Alice 
thus agree on one shared key and Carol and Bob agree on another shared key. After this exchange, 
Carol simply decrypts any messages sent out by Alice or Bob, and then reads and possibly modifies 
them before re-encrypting with the appropriate key and transmitting them to the other party. This 
vulnerability is present because Diffie-Hellman key exchange does not authenticate the participants. 
Possible solutions include the use of digital signatures and other protocol variants. 


The authenticated Diffie-Hellman key agreement protocol, or Station-to-Station (STS) protocol, was 
developed by Diffie, van Oorschot, and Wiener in 1992 [DVWJI2] to defeat the man-in-the-middle 
attack on the Diffie-Hellman key agreement protocol. The immunity is achieved by allowing the two 
parties to authenticate themselves to each other by the use of digital signatures (see Question 
and public-key certificates (see Question H.1.3.10). 


Roughly speaking, the basic idea is as follows. Prior to execution of the protocol, the two parties 
Alice and Bob each obtain a public/private key pair and a certificate for the public key. During 
the protocol, Alice computes a signature on certain messages, covering the public value g* mod p. 


Bob proceeds in a similar way. Even though Carol is still able to intercept messages between Alice 
and Bob, she cannot forge signatures without Alice’s private key and Bob’s private key. Hence, the 
enhanced protocol defeats the man-in-the-middle attack. 


In recent years, the original Diffie-Hellman protocol has been understood to be an example of a 
much more general cryptographic technique, the common element being the derivation of a shared 
sectet value (that is, key) from one party’s public key and another party’s private key. The parties’ 
key pairs may be generated anew at each run of the protocol, as in the original Diffie-Hellman 
protocol. The public keys may be certified, so that the parties can be authenticated and there may 
be a combination of these attributes. The draft ANSI X9.42 (see Question 6.3.1) illustrates some 
of these combinations, and a recent paper by Blake-Wilson, Johnson, and Menezes provides some 
relevant security proofs. 
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3.6.2 Whatis RC2? 


RC2 is a variable key-size block cipher designed by Ronald Rivest for RSA Data Security (now RSA 
Security). “RC” stands for “Ron's Code” or “Rivest’s Cipher.” It is faster than DES and is designed 
as a “drop-in” replacement for DES (see Section B.2). It can be made more secure or less secure 
than DES against exhaustive key search by using appropriate key sizes. It has a block size of 64 bits 
and is about two to three times faster than DES in software. An additional string (40 to 88 bits long) 
called a salt can be used to thwart attackers who try to precompute a large look-up table of possible 
encryptions. The salt is appended to the encryption key, and this lengthened key is used to encrypt 
the message. The salt is then sent, unencrypted, with the message. RC2 and RC4 have been widely 
used by developers who want to export their products; more stringent conditions have been applied 
to DES exports. 


An agreement between the Software Publishers Association (SPA) and the United States government 
has been given RC2 and RC4 (see Question B.6.3) special status by means of which the export 
approval process has been simpler and quicker than the usual cryptographic export process. Due 
to dramatically relaxed restrictions on export regulations as of January 2000, the greater part of this 
agreement will probably no longer be needed. 


3.6.3 Whatis RC4? 


RC4 is a stream cipher designed by Rivest for RSA Data Security (now RSA Security). It is a variable 
key-size stream cipher with byte-oriented operations. The algorithm is based on the use of a random 
permutation. Analysis shows that the period of the cipher is overwhelmingly likely to be greater 
than 101%. Eight to sixteen machine operations are required per output byte, and the cipher can be 
expected to run very quickly in software. Independent analysts have scrutinized the algorithm and it 
is considered secute. 


RCA is used for file encryption in products such as RSA SecurPC (see Question 6.2.4). It is also used 
for secure communications, as in the encryption of traffic to and from secure web sites using the 
SSL protocol (see Question 6.1.2). 
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3.6.4 What are RC5 and RC6? 


RC5 [Riv95] is a fast block cipher designed by Ronald Rivest for RSA Data Security (now RSA 
Security) in 1994. It is a parameterized algorithm with a variable block size, a variable key size, and 
a variable number of rounds. Allowable choices for the block size are 32 bits (for experimentation 
and evaluation purposes only), 64 bits (for use a drop-in replacement for DES), and 128 bits. The 
number of rounds can range from 0 to 255, while the key can range from 0 bits to 2040 bits in size. 
Such built-in variability provides flexibility at all levels of security and efficiency. 


There are three routines in RC5: key expansion, encryption, and decryption. In the key-expansion 
routine, the user-provided secret key is expanded to fill a key table whose size depends on the 
number of rounds. The key table is then used in both encryption and decryption. The encryption 
routine consists of three primitive operations: integer addition, bitwise XOR, and variable rotation. 
The exceptional simplicity of RC5 makes it easy to implement and analyze. Indeed, like the RSA 
system, the encryption steps of RC5 can be written on the “back of an envelope”. 


The heavy use of data-dependent rotations and the mixture of different operations provide the 
security of RC5. In particular, the use of data-dependent rotations helps defeat differential and linear 


cryptanalysis (see Question 2.4.5). 


In the five years since RC5 was proposed, there have been numerous studies of RC5’s security 


[KY95] [IKM9G] [BK98] [Sel98]. Each study has provided a greater understanding of how RC5’s 


structure and components contribute to its security. For a summary of known cryptanalytic results, 
see the survey article [Yin97]. 


RC6 is a block cipher based on RC5 and designed by Rivest, Sidney, and Yin for RSA Security. Like 
RC5, RC6 is a parameterized algorithm where the block size, the key size, and the number of rounds 
are variable; again, the upper limit on the key size is 2040 bits. The main goal for the inventors has 
been to meet the requirements of the AES (see Section B.3). Indeed, RC6 is among the five finalists 


(see Question B.3.2). 


There are two main new features in RC6 compared to RC5: the inclusion of integer multiplication 
and the use of four b/4-bit working registers instead of two b/2-bit registers as in RC5 (b is the 
block size). Integer multiplication is used to increase the diffusion achieved per round so that fewer 
rounds are needed and the speed of the cipher can be increased. The reason for using four working 
registers instead of two is technical rather than theoretical. Namely, the default block size of the AES 
is 128 bits; while RC5 deals with 64-bit operations when using this block size, 32-bit operations are 
preferable given the intended architecture of the AES. 


The U.S. patent office granted the RC5 patent to RSA Data Security (now RSA Security) in May 
1997. RC6 is proprietary of RSA Security but can be freely used for research and evaluation purposes 
during the AES evaluation period. We emphasize that ¿f RC6 is selected for the AES, RSA Security 
will not require any licensing or royalty payments for products using the algorithm; there will be no 
restrictions beyond those specified for the AES by the U.S. government. However, RC6 may remain 
a trademark of RSA Security. 


3.6.5 What are SHA and SHA-1? 


The Secure Hash Algorithm (SHA), the algorithm specified in the Secure Hash Standard (SHS, FIPS 
180), was developed by NIST (see Question (6.2.1) [INIS93a]. SHA-1 [NIS94d] is a revision to SHA 
that was published in 1994; the revision corrected an unpublished flaw in SHA. Its design is vety 
similar to the MD4 family of hash functions developed by Rivest (see Question B.6.6). SHA-1 is also 
described in the ANSI X9.30 (part 2) standard. 


The algorithm takes a message of less than 2° bits in length and produces a 160-bit message digest. 
The algorithm is slightly slower than MD5 (see Question B.6.0), but the larger message digest makes 
it more secure against brute-force collision and inversion attacks (see Question 2.1.0). SHA is part 
of the Capstone project (see Question 6.2.3). For further information on SHA, see [Pre93] and 


[Rob95H. 
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3.6.6 What are MD2, MD4, and MD5? 


MD2 [Ka192], MD4 [Riv9Tb] [Riv92b], and MD5 [Riv92d] are message-digest algorithms developed 


by Rivest. They ate meant for digital signature applications where a large message has to be 
“compressed” in a secure manner before being signed with the private key. All three algorithms take 
a message of arbitrary length and produce a 128-bit message digest. While the structures of these 
algorithms are somewhat similar, the design of MD2 is quite different from that of MD4 and MD. 
MD2 was optimized for 8-bit machines, whereas MD4 and MD5 were aimed at 32-bit machines. 
Description and source code for the three algorithms can be found as Internet RFCs 1319-1321 


[K292] [Riv22b] [Riv22d. 


MD2 was developed by Rivest in 1989. The message is first padded so its length in bytes is divisible 
by 16. A 16-byte checksum is then appended to the message, and the hash value is computed on the 
resulting message. Rogier and Chauvaud have found that collisions for MD2 can be constructed if 
the calculation of the checksum is omitted [RC95]. This is the only cryptanalytic result known for 
MD2. 


MD4 was developed by Rivest in 1990. The message is padded to ensure that its length in bits plus 
64 is divisible by 512. A 64-bit binary representation of the original length of the message is then 
concatenated to the message. The message is processed in 512-bit blocks in the Damgard/Merkle 
iterative structure (see Question 2.T.6), and each block is processed in three distinct rounds. Attacks 
on versions of MD4 with either the first or the last rounds missing were developed very quickly by 
Den Boer, Bosselaers [IDB92] and others. Dobbertin [Dob95] has shown how collisions for the full 
version of MD4 can be found in under a minute on a typical PC. In recent work, Dobbertin (Fast 
Software Encryption, 1998) has shown that a reduced version of MD4 in which the third round 
of the compression function is not executed but everything else remains the same, is not one-way. 
Clearly, MD4 should now be considered broken. 


MD5 was developed by Rivest in 1991. It is basically MD4 with “safety-belts” and while it is slightly 
slower than MD4, it is more secure. The algorithm consists of four distinct rounds, which has a 
slightly different design from that of MD4. Message-digest size, as well as padding requirements, 
remain the same. Den Boer and Bosselaers [DB94] have found pseudo-collisions for MD5 (see 
Question 2.1.6). More recent work by Dobbertin has extended the techniques used so effectively 
in the analysis of MD4 to find collisions for the compression function of MD5 [DB9Gb]. While 
stopping short of providing collisions for the hash function in its entirety this is clearly a significant 
step. For a comparison of these different techniques and their impact the reader is referred to 


[Rob9G). 


Van Oorschot and Wiener [VW/94] have considered a brute-force search for collisions (see Ques- 
tion 2.1.0) in hash functions, and they estimate a collision search machine designed specifically for 
MD5 (costing $10 million in 1994) could find a collision for MD5 in 24 days on average. The general 
techniques can be applied to other hash functions. 


More details on MD2, MD4, and MD5 can be found in [Pre93] and [Rob95h]. 


3.6.7 Whatare some other block ciphers? 


Many of the block ciphers proposed in recent years, including those listed below, were developed (at 
least in part) either as successors to DES or as candidates for the Advanced Encryption Standard, 
AES. See Sections B.J and P.J for more information on DES and AES, respectively. For descriptions 
of the five finalists to the AES (MARS, Rijndael, RC6, Serpent, and Twofish), see Question B.3.2. 


IDEA (International Data Encryption Algorithm) [LMM] is the second version of a block cipher 
designed and presented by Lai and Massey [LM9T]. It is a 64-bit iterative block cipher with a 128-bit 
key. The encryption process requires eight complex rounds. While the cipher does not have a Feistel 
structure (see Question 2.1.4), decryption is carried out in the same manner as encryption once the 
decryption subkeys have been calculated from the encryption subkeys. The cipher structure was 
designed to be easily implemented in both software and hardware, and the security of IDEA relies 
on the use of three incompatible types of arithmetic operations on 16-bit words. However some of 
the arithmetic operations used in IDEA are not that fast in software. As a result the speed of IDEA 
in software is similar to that of DES. 


One of the principles used during the design of IDEA was to facilitate analysis of its strength against 
differential cryptanalysis (see Question 2.4.5) and IDEA is considered to be immune to differential 
cryptanalysis. Furthermore there are no linear cryptanalytic attacks on IDEA and there are no known 
algebraic weaknesses in IDEA. The most significant cryptanalytic result is due to Daemen [DGV94], 
who discovered a large class of 2%! weak keys (see Question for which the use of such a key 
during encryption could be detected easily and the key recovered. However, since there are 2128 
possible keys, this result has no impact on the practical security of the cipher for encryption provided 
the encryption keys are chosen at random. IDEA is generally considered to be a very secure cipher 
and both the cipher development and its theoretical basis have been openly and widely discussed. 


SAFER (Secure And Fast Encryption Routine) is a non-proprietary block cipher developed by 
Massey in 1993 for Cylink Corporation [Mas93]. It is a byte-oriented algorithm with a 64-bit block 
size and, in one version, a 64-bit key size. It has a variable number of rounds, but a minimum of six 
rounds is recommended. Unlike most recent block ciphers, SAFER has slightly different encryption 
and decryption procedures. Only byte-based operations are employed to ensure its utility in smart 
card-based applications that have limited processing power. When first announced, SAFER was 
intended to be implemented with a key of length 64 bits and it was accordingly named SAFER K-64. 
Another version of SAFER was designed that could handle 128-bit keys and was named SAFER 
K-128. A variant -- SAFER+ -- was submitted to the AES, but the algorithm did not qualify for the 
second round, due to its lack of speed. 


Early cryptanalysis of SAFER K-64 [Mas93] showed that SAFER K-64 could be considered immune 
to both differential and linear cryptanalysis (see Question 2.4.5) when the number of rounds is 
greater than six. However, Knudsen [Knu95] discovered a weakness in the key schedule of SAFER 
K-64 and a new key schedule for the family of SAFER ciphers soon followed. These new versions 
of SAFER are denoted SAFER SK-64 and SAFER SK-128 where SK denotes a strengthened key 
schedule (though one joke has it that SK really stands for “Stop Knudsen”, a wise precaution in 
the design of any block cipher). Most recently, a version of SAFER called SAFER SK-40 was 
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announced, which uses a 40-bit key and has five rounds (thereby increasing the speed of encryption). 
This reduced-round version is secure against differential and linear cryptanalysis in the sense that any 
such attack would require more effort than a brute-force search for a 40-bit key. 


The Fast Data Encipherment Algorithm (FEAL) was presented by Shimizu and Miyaguchi [SM88] 
as an alternative to DES. The original cipher (called FEAL-4) was a four-round cryptosystem with 
a 64-bit block size and a 64-bit key size and it was designed to give high performance in software. 
Soon a variety of attacks against FEAL-4 were announced including one attack that required only 
20 chosen plaintexts [Mur90]. Several results in the cryptanalysis of FEAL-8 (eight-round version) 
led the designers to introduce a revised version, FEAL-N, where N denoted the number of rounds. 
Biham and Shamir [BS91b] developed differential cryptanalytic attacks against FEAL-N for up to 31 
rounds. In 1994, Ohta and Aoki presented a linear cryptanalytic attack against FEAL-8 that required 
22 known plaintexts [OA94], and other improvements [KR95a] followed. In the wake of these 
numerous attacks, FEAL and its derivatives should be considered insecure. 


Skipjack is the encryption algorithm contained in the Clipper chip (see Question (6.2.4), designed by 
the NSA (see Question 6.2.2). It uses an 80-bit key to encrypt 64-bit blocks of data. Skipjack is 
expected to be more secure than DES in the absence of any analytic attack since it uses 80-bit keys. 
By contrast, DES uses 56-bit keys. 


Initially, the details of Skipjack were classified and the decision not to make the details of the 
algorithm publicly available was widely criticized. Some people were suspicious that Skipjack might 
not be secure, either due to an oversight by its designers, or by the deliberate introduction of a secret 
trapdoor. Since Skipjack was not public, it could not be widely scrutinized and there was little public 
confidence in the cipher. 


Aware of such criticism, the government invited a small group of independent cryptographers to 
examine the Skipjack algorithm. They issued a report [BDKY3] that stated that although their study 
was too limited to reach a definitive conclusion, they nevertheless believed Skipjack was secute. 


In June 1998 Skipjack was declassified by the NSA. Early cryptanalysis has failed to find any 
substantial weakness in the cipher. 


Blowfish is a 64-bit block cipher developed by Bruce Schneier [Sch93]. It is a Feistel cipher 
(see Question 2.1.4) and each round consists of a key-dependent permutation and a key-and-data- 
dependent substitution. All operations are based on XORs and additions on 32-bit words. The key 
has a variable length (with a maximum length of 448 bits) and is used to generate several subkey 
arrays. This cipher was designed specifically for 32-bit machines and is significantly faster than DES. 
There was an open competition for the cryptanalysis of Blowfish supported by Dr. Dobb’s Journal 
with a $1000 prize. This contest ended in April 1995 [Sch95]; among the results were the discoveries 
of existence of certain weak keys (see Question 2.4.5), an attack against a three-round version of 
Blowfish, and a differential attack against certain variants of Blowfish. However, Blowfish can still 
be considered secure, and Schneier has invited cryptanalysts to continue investigating his cipher. The 
AES candidate Twofish is based on Blowfish. 


CAST-128 is another popular 64-bit Feistel cipher allowing key sizes up to 128 bits. The name 
CAST stands for Carlisle Adams and Stafford Tavares, the original inventors of CAST. CAST-128 
consists of 16 non-identical rounds, where each round is built up by simple operations such as integer 
and bitwise addition and rotation. CAST-128 is owned by Entrust Technologies but is free for 
commercial as well as non-commercial use. The algorithm has been widely adopted by the internet 
community and is part of products from Pretty Good Privacy, IBM, and Microsoft. CAST-256 is a 
freely available extension of CAST-128 accepting up to 256 bits of key size and with a 128-bit block 
size. CAST-256 was one of the original candidates for the AES. Though no security weaknesses were 
found, the algorithm did not qualify for the second round. CAST-256 and the finalist Serpent share 
the property of strongly favoring security over speed, and since it is considered as unlikely that two 
“slow” algorithms would be selected for the AES, only one of them qualified for the second round. 
We emphasize that CAST-256 can be considered as secure. 
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3.6.8 What are some other public-key cryptosystems? 


The ElGamal system [Elg85] is a public-key cryptosystem based on the discrete logarithm problem. 
It consists of both encryption and signature variants. The encryption algorithm is similar in nature 
to the Diffie-Hellman key agreement protocol (see Question B.6.1). 


The system parameters for the ElGamal cryptosystem are a prime p and an integer g, whose powers 
modulo p generate a large number of elements (it is not necessary for g to be a generator of the 
group Z;; however, it is ideal). Alice has a private key a and a public key y, where y = g” mod p. 
Suppose Bob wishes to send a message m to Alice. Bob first generates a random number k less than 
p. He then computes 


yı = g" mod p and ya = y*m mod p. 


Bob sends (y1, y2) to Alice. Upon receiving the ciphertext, Alice computes y, “y2 mod p. This is 
equal to m, because 


y y2 =g “y'm=y y"m=m (mod p). 


The ElGamal signature algorithm is similar to the encryption algorithm in that the public key and 
private key have the same form. However, encryption is not the same as signature verification, nor 
is decryption the same as signature creation as in the RSA system (see Question B.1.1). DSA, The 
Digital Signature Algorithm (see Section B.4), is based in part on the ElGamal signature algorithm. 


Analysis based on the best available algorithms for both factoring and discrete logarithms show that 
the RSA system and the ElGamal system have similar security for equivalent key lengths. The main 
disadvantage of the ElGamal system is the need for randomness, and its slower speed (especially for 
signing). Another potential disadvantage of the ElGamal system is that message expansion by a factor 
of two takes place during encryption. However, such message expansion is generally unimportant if 
the cryptosystem is used only for exchange of secret keys. 


The Merkle-Hellman knapsack cryptosystem [MH78] is a public-key cryptosystem first published 
in 1978. It is commonly referred to as the knapsack cryptosystem. It is based on the subset sum 
problem in combinatorics. The problem involves selecting a number of objects with given weights 
from a large set such that the sum of the weights is equal to a pre-specified weight. This is considered 
to be a difficult problem to solve in general, but certain special cases of the problem are relatively 
easy to solve, which serve as the “trapdoor” of the system. Shamir broke the single iteration knapsack 
cryptosystem introduced in 1978 [Sha84]. Merkle then published the multiple-iteration knapsack 
problem broken by Brickell [Bri85]. Merkle offered from his own pocket a $100 reward for anybody 
able to crack the single iteration knapsack and a $1000 reward for anybody able to crack the multiple 
iteration cipher. When they were cracked, he promptly paid up. 


The Chor-Rivest knapsack cryptosystem was first published in 1984, followed by a revised version 
in 1988 [CR88]. It is the only knapsack-like cryptosystem that does not use modular multiplication. 
It was also the only knapsack-like cryptosystem that was secure for any extended period of time. 
Eventually, Schnorr and Horner [SH95] developed an attack on the Chor-Rivest cryptosystem 
using improved lattice reduction which reduced to hours the amount of time needed to crack the 
cryptosystem for certain parameter values (though not for those recommended by Chor and Rivest). 


They also showed how the attack could be extended to attack Damgard”s knapsack hash function 


[Dam90]. 


LUC is a public-key cryptosystem [SSY5] developed by a group of researchers in Australia and 
New Zealand. The cipher implements the analogs of the ElGamal system (see Question B.6.9), the 
Diffie-Hellman key agreement protocol (see Question B.6.1), and the RSA system (see Section B.1) 
over Lucas sequences. LUCELG is the Lucas sequence analog of ElGamal, while LUCDIF and 
LUCRSA are the Diffie-Hellman and RSA analogs, respectively. Lucas sequences used in the 
ctyptosystem are the general second-order linear recurrence relation defined by 


tn = Ptn—1 — qtn-2, 


where p and q are relatively prime integers. The encryption of the message is computed by iterating 
the recurrence, instead of by exponentiation as in the RSA system and the Diffie-Hellman protocol. 


A paper by Bleichenbacher et al. [BBL95] shows that many of the supposed security advantages 
of LUC over cryptosystems based on modular exponentiation are either not present, or not as 
substantial as claimed. 


The McEliece cryptosystem [Mce78] is a public-key encryption algorithm based on algebraic coding 
theory. The system uses a class of error-correcting codes known as the Goppa codes, for which fast 
decoding algorithms exist. The basic idea is to construct a Goppa code as the private key and disguise 
it as a general linear code, which is the public key. The general linear code cannot be easily decoded 
unless the corresponding private matrix is known. The McEliece cryptosystem has a number of 
drawbacks. These include large public key size (around half a megabit), substantial expansion of data, 
and possibly a certain similarity to the knapsack cryptosystem. Gabidulin, Paramonov, and Tretjakov 
[GPT91] proposed a modification of the McEliece cryptosystem by replacing Goppa codes with 
a different algebraic code and claimed the new version was more secure than the original system. 
However, Gibson [Gib93] later showed there was not really any advantage to the new version. 
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3.6.9 What are some other signature schemes? 


Merkle proposed a digital signature scheme based on both one-time signatures (see Question [7.7) and 
a hash function (see Question 2.1.0); this provides an infinite tree of one-time signatures [MerJ0b]. 


One-time signatures normally require the publishing of large amounts of data to authenticate many 
messages, since each signature can only be used once. Merkle’s scheme solves the problem by 
implementing the signatures via a tree-like scheme. Each message to be signed corresponds to a 
node in a tree, with each node consisting of the verification parameters used to sign a message and 
to authenticate the verification parameters of subsequent nodes. Although the number of messages 
that can be signed is limited by the size of the tree, the tree can be made arbitrarily large. Merkle’s 
signature scheme is fairly efficient, since it requires only the application of hash functions. 


The Rabin signature scheme [Rab79] is a variant of the RSA signature scheme (see Question B.T.1). 
It has the advantage over the RSA system that finding the private key and forgery are both provably 
as hard as factoring. Verification is faster than signing, as with RSA signatures. In Rabin’s scheme, 
the public key is an integer n where n = pq, and p and q are prime numbers which form the private 
key. The message to be signed must have a square root mod n; otherwise, it has to be modified 
slightly. Only about 1/4 of all possible messages have square roots mod n. The signature s of m is 
s = ml/? mod n. Thus to verify the signature, the receiver computes m = s? mod n. 


The signature is easy to compute if the prime factors of n are known, but provably difficult otherwise. 
Anyone who can consistently forge the signature for a modulus n can also factor n. The provable 
security has the side effect that the prime factors can be recovered under a chosen message attack. 
This attack can be countered by padding a given message with random bits or by modifying the 
message randomly, at the loss of provable security. See [GMR&86] for a discussion of a way to get 
around the paradox between provable security and resistance to chosen message attacks. 


3.6.10 What are some other stream ciphers? 


There are a number of alternative stream ciphers that have been proposed in cryptographic literature 
as well as a large number that appear in implementations and products world-wide. Many are based 
on the use of LFSRs (Linear Feedback Shift Registers; see Question 2.1.5), since such ciphers tend 
to be more amenable to analysis and it is easier to assess the security they offer. 


Rueppel suggests there are essentially four distinct approaches to stream cipher design [RueY2]. 
The first is termed the information-theoretic approach as exemplified by Shannon’s analysis of the 
one-time pad. The second approach is that of system-theoretic design. In essence, the cryptographer 
designs the cipher along established guidelines that ensure the cipher is resistant to all known attacks. 
While there is, of course, no substantial guarantee that future cryptanalysis will be unsuccessful, it is 
this design approach that is perhaps the most common in cipher design. The third approach is to 
attempt to relate the difficulty of breaking the stream cipher (where “breaking” means being able to 
predict the unseen keystream with a success rate better than can be achieved by guessing) to solving 
some difficult problem (see [BM84] [BBS88]). This complexity-theoretic approach is very appealing, 
but in practice the ciphers developed tend to be rather slow and impractical. The final approach 
highlighted by Rueppel is that of designing a randomized cipher. Here the aim is to ensure the 
cipher is resistant to any practical amount of cryptanalytic work, rather than being secure against an 
unlimited amount of work, as was the aim with Shannon’s information-theoretic approach. 


A recent example of a stream cipher designed by a system-theoretic approach is the Software- 
optimized Encryption Algorithm (SEAL), which was designed by Rogaway and Coppersmith in 1993 
[RC93] as a fast stream cipher for 32-bit machines. SEAL has a rather involved initialization phase 
during which a large set of tables is initialized using the Secure Hash Algorithm (see Question B.6.5). 
However, the use of look-up tables during keystream generation helps to achieve a very fast 
performance with just five instructions required per byte of output generated. 


A design that has system-theoretic as well as complexity-theoretic aspects is given by Aiello, 
Rajagopalan, and Venkatesan [ARV95]. The design, commonly referred to as “VRA,” derives a 
fast stream cipher from an arbitrary secure block cipher. VRA is described as a pseudo-random 
generator (see Question 2.5.2), not a stream cipher, but the two concepts are closely connected, since 
a pseudo-random generator can produce a (pseudo) one-time pad for encryption. 


For examples of ciphers in each of these categories, see Rueppel’s article [Rue92] or any book on 
contemporary cryptography. More details are also provided in an RSA Laboratories technical report 


[Rob95a]. 
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3.6.11 What other hash functions are there? 


The best review of hash function techniques is provided by Preneel [Pre¥3]. For a brief overview 
here, we note that hash functions are often divided into three classes: 


e Hash functions built around block ciphers. 
e Hash functions using modular arithmetic. 
e Hash functions with what is termed a “dedicated” design. 


By building a hash function around a block cipher, a designer aims to leverage the security of a 
well-trusted block cipher such as DES (see Section to obtain a well-trusted hash function. The 
so-called Davies-Meyer hash function [Pre¥3] is an example of a hash function built around the use 
of DES. 


The purpose of employing modular arithmetic in the second class of hash functions is to save on 
implementation costs. A hash function is generally used in conjunction with a digital signature 
algorithm which itself makes use of modular arithmetic. Unfortunately, the track record of such hash 
functions is not good from a security perspective and there are no hash functions in this second class 
that can be recommended for use today. 


The hash functions in the third class, with their so-called “dedicated” design, tend to be fast, 
achieving a considerable advantage over algorithms that are based around the use of a block cipher. 
MD4 is an early example of a popular hash function with such a design. Although MD4 is no longer 
considered secure for most cryptographic applications, most new dedicated hash functions make use 
of the same design principles as MD4 in a strengthened version. Their strength varies depending on 
the techniques, or combinations of techniques, employed in their design. Dedicated hash functions 
in current use include MD5 and SHA-1 (see Questions B.6.3] and B.6.6), as well as RIPEMD-160 


[DBP9G] and HAVAL [ZPS93]. 


3.6.12 What are some secret sharing schemes? 


Shamir's secret sharing scheme [Sha79] is a threshold scheme based on polynomial interpolation. To 
allow any m out of n people to construct a given secret, an (m — 1)-degree polynomial 


f(a) = ao + aie +--+ + amix”! 


over the finite field GF(q) is constructed such that the coefficient ap is the secret and all other 
coefficients are random elements in the field; the field is known to all participants. Each of the 
n shares is a pair (x;, yi) of numbers satisfying f(1,) = yi and x; # 0. Given any m shares, the 
polynomial is uniquely determined and hence the secret ay can be computed. However, given m — 1 
or fewer shares, the secret can be any element in the field. Therefore, Shamir’s scheme is a perfect 
secret sharing scheme (see Question 2.1.9). 
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Figure 3.1: Shamir’s secret sharing scheme. 


A special case where m = 2 (that is, two shares are required for retrieval of the secret) is given in 
Figure B.1, The polynomial is a line and the secret is the point where the line intersects with the 
y-axis. Namely, this point is the point (0, f(0)) = (0, ao). Each share is a point on the line. Any two 
points determine the line and hence the secret. With just a single point, the line can be any line that 
passes the point, and hence the secret can be any point on the y-axis. 


Blakley’s secret sharing scheme [Bla79] is geometric in nature. The secret is a point in an m- 
dimensional space. n shares ate constructed with each share defining an affine hyperplane in this 
space; an affine hyperplane is the set of solutions x = (21,...,%m) to an equation of the form 
4111 +++: + amm = b. By finding the intersection of any m of these planes, the secret (that is, the 
point of intersection) can be obtained. This scheme is not perfect, as the person with a shate of 
the secret knows the secret is a point on his or her hyperplane. Nevertheless, this scheme can be 
modified to achieve perfect security [SimI2]. 


A special case of Blakley’s scheme is shown in Figure This is based on a scenario where two 
shares are required to recover the secret, which means that a two-dimensional plane is used. The 
secret is a point in the plane. Each share is a line that passes through the point. If any two of the 
shares are put together, the point of intersection, which is the secret, can be easily derived. 


Naor and Shamir [NS94] developed what they called visual secret sharing schemes, which are an 
interesting visual variant of the ordinary secret sharing schemes. 
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Figure 3.2: Blakley’s scheme. 


Roughly speaking, the problem can be formulated as follows: There is a secret picture to be shared 
among n participants. The picture is divided into n transparencies (shares) such that if any m 
transparencies ate placed together, the picture becomes visible, but if fewer than m transparencies 
are placed together, nothing can be seen. Such a scheme is constructed by viewing the secret picture 
as a set of black and white pixels and handling each pixel separately (see [NS94] for more details). 
The schemes are perfectly secure and easily implemented without any cryptographic computation. 
A further improvement allows each transparency (share) to be an innocent picture (for example, a 
picture of a landscape or a picture of a building), thus concealing the fact that secret sharing is taking 
place. 


CHAPTER 4 
Applications of Cryptography 


This chapter is an overview of the most important protocols and systems made possible by cryp- 
tography. In particular, it discusses the issues involved in establishing a cryptographic infrastructure, 
and it gives a brief overview of some of the electronic commerce techniques available today. 


4.1 KEY MANAGEMENT 
4.1.1 Whatis key management? 


Key management deals with the secure generation, distribution, and storage of keys. Secure methods 
of key management are extremely important. Once a key is randomly generated (see QuestionH.1.2.2), 
it must remain secret to avoid unfortunate mishaps (such as impersonation). In practice, most attacks 
on public-key systems will probably be aimed at the key management level, rather than at the 
ctyptographic algorithm itself. 


Users must be able to securely obtain a key pair suited to their efficiency and security needs. There 
must be a way to look up other people’s public keys and to publicize one’s own public key. Users 
must be able to legitimately obtain others’ public keys; otherwise, an intruder can either change 
public keys listed in a directory, or impersonate another user. Certificates are used for this purpose 
(see Question f.1.3.10). Certificates must be unforgeable. The issuance of certificates must proceed 
in a secure way, impervious to attack. In particular, the issuer must authenticate the identity and the 
public key of an individual before issuing a certificate to that individual. 


If someone’s private key is lost or compromised, others must be made aware of this, so they will no 
longer encrypt messages under the invalid public key nor accept messages signed with the invalid 
private key. Users must be able to store their private keys securely, so no intruder can obtain them, 
yet the keys must be readily accessible for legitimate use. Keys need to be valid only until a specified 
expiration date but the expiration date must be chosen properly and publicized in an authenticated 
channel. 
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4.1.2 GENERAL 
4.1.2.1 Whatkey size should be used? 


The key size that should be used in a particular application of cryptography depends on two things. 
First of all, the value of the key is an important consideration. Secondly, the actual key size depends 
on what cryptographic algorithm is being used. 


Due to the rapid development of new technology and cryptanalytic methods, the correct key size 
for a particular application is continuously changing. For this reason, RSA Laboratories refers to 


its web site [http://www.rsasecurity.com/rsalabs/ for updated recommendations. The table 


below contains key size limits and recommendations from different sources for block ciphers, the 
RSA system, the elliptic curve system, and DSA. 


Some comments: 


e Export grade or nominal grade gives little real protection; the key sizes are the limits specified 
in the Wassenaar Arrangement (see Question 6.5.3). 


“Traditional recommendations” are recommendations such as those given in earlier versions of 
this FAQ. Such recommendations are normally based on the traditional approach of counting 
MIPS-years for the best available key breaking algorithms. There are several reasons to call 
this approach in question. For example, an algorithm with massive memory requirements is 
probably not equivalent to an algorithm with low memory requirements. 


e The last rows in the table give lower bounds for commercial applications as suggested by 
Lenstra and Verheul [LYOO]. The first of these rows shows recommended key sizes of today, 
while the second row gives estimated lower bounds for 2010. The bounds are based on the 
assumption that DES was sufficiently secure until 1982 along with several hypotheses, which 
are all extrapolations in the spirit of Moore’s Law (the computational power of a chip doubles 
every 18 months). One questionable assumption they make is that computers and memory will 
be able for free. It seems that this assumption is not realistic for key breaking algorithms with 
large memory requirements. One such algorithm is the General Number Field Sieve used in 


RSA key breaking efforts. 
Block Cipher | RSA Elliptic Curve | DSA 
Export Grade 56 512 112 512 / 112 
Traditional 80 1024 160 1024 / 160 
recommendations 112 2048 224 2048 / 224 
Lenstra/Verheul 2000 | 70 952 132 952 / 125 
2010 || 78 1369 146 / 160 1369 / 138 


Table 2. Minimal key lengths in bits for different grades. 


Notes. The RSA key size refers to the size of the modulus. The Elliptic Curve key size refers to the 
minimum order of the base point on the elliptic curve; this order should be slightly smaller than 
the field size. The DSA key sizes refer to the size of the modulus and the minimum size of a large 


subgroup, respectively (the size of the subgroup is often considerably larger in applications). In the 
last row there are two values for elliptic curve cryptosystems; the choice of key size should depend 
on whether any significant cryptanalytic progress in this field is expected or not. 
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4.1.2.2 How does one find random numbers for keys? 


Whether using a secret-key cryptosystem or a public-key cryptosystem, one needs a good source 
of random numbers for key generation. The main features of a good soutce are that it produces 
numbers that are unknown and unpredictable by potential adversaries. Random numbers obtained 
from a physical process ate in principle the best, since many physical processes appear truly random. 
One could use a hardware device, such as a noisy diode; some are sold commercially on computer 
add-in boards for this purpose. Another idea is to use physical movements of the computer user, 
such as inter-key stroke timings measured in microseconds. Techniques using the spinning of disks to 
generate random data are not truly random, as the movement of the disk platter cannot be considered 
truly random. A negligible-cost alternative is available; Davis et al. designed a random number 
generator based on the variation of a disk drive motor's speed [DIP94]. This variation is caused by air 
turbulence, which has been shown to be unpredictable. By whichever method they are generated, the 
random numbers may still contain some correlation, thus preventing sufficient statistical randomness. 
Therefore, it is best to run them through a good hash function (see Question 2.1.0) before actually 


using them [ECS9A]. 


Another approach is to use a pseudo-random number generator fed by a random seed. The 
primary difference between random and pseudo-random numbers is that pseudo-random numbers 
are necessarily periodic whereas truly random numbers are not. Since pseudo-random number 
generators are deterministic algorithms, it is important to find one that is cryptographically secure 
and also to use a good random seed; the generator effectively acts as an “expander” from the seed 
to a larger amount of pseudo-random data. The seed must be sufficiently variable to deter attacks 
based on trying all possible seeds. 


It is not sufficient for a pseudo-random number generator just to pass a variety of statistical tests, 
as described in Knuth [Knu8T] and elsewhere, because the output of such generators may still be 
predictable. Rather, it must be computationally infeasible for an attacker to determine any bit of the 
output sequence, even if all the others are known, with probability better than 1/2. Blum and Micali’s 
generator based on the discrete logarithm problem [[BM84] satisfies this stronger definition, assuming 
that computing discrete logarithm is difficult (see Question 2.3.7). Other generators perhaps based 
on DES (see Section B.J) or a hash function (see Question 2.1.6) can also be considered to satisfy 
this definition, under reasonable assumptions. 


A summary of methods for generating random numbers in software can be found in [Mar90]. 


Note that one does not need random numbers to determine the public and private exponents in RSA. 
After generating the primes, and hence the modulus (see Question B.T.T), one can simply choose an 
arbitrary value (subject to the standard constraints) for the public exponent, which then determines 
the private exponent. 


4.1.2.3 Whatis the life cycle of a key? 


Keys have limited lifetimes for a number of reasons. The most important reason is protection against 
cryptanalysis (see Section 2.4). Each time the key is used, it generates a number of ciphertexts. Using 
a key repetitively allows an attacker to build up a store of ciphertexts (and possibly plaintexts) which 
may prove sufficient for a successful cryptanalysis of the key value. Thus keys should have a limited 
lifetime. If you suspect that an attacker may have obtained your key, the key should be considered 
compromised, and its use discontinued. 


Research in cryptanalysis can lead to possible attacks against either the key or the algorithm. 
For example, recommended RSA key lengths are increased every few years to ensure that the 
improved factoring algorithms do not compromise the security of messages encrypted with RSA. 
The recommended key length depends on the expected lifetime of the key. Temporary keys, which 
are valid for a day or less, may be as short as 512 bits. Keys used to sign long-term contracts for 
example, should be longer, say, 1024 bits or more. 


Another reason for limiting the lifetime of a key is to minimize the damage from a compromised key. 
It is unlikely a user will discover an attacker has compromised his or her key if the attacker remains 
“passive.” Relatively frequent key changes will limit any potential damage from compromised keys. 
Ford [Eor¥4] describes the life cycle of a key as follows: 


1. Key generation and possibly registration (for a public key). 
2. Key distribution. 

3. Key activation/deactivation. 

4. Key replacement or key update. 

5. Key revocation. 


6. Key termination, involving destruction or possibly archival. 
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4.1.3 PUBLIC-KEY ISSUES 
4.1.3.1 Whatis a PKI? 


A public-key infrastructure (PKI) consists of protocols, services, and standards supporting applica- 
tions of public-key cryptography. The term PKI, which is relatively recent, is defined variously in 
current literature. PKI sometimes refers simply to a trust hierarchy based on public-key certificates 
[1], and in other contexts embraces encryption and digital signature services provided to end-user 
applications as well [OGY]. A middle view is that a PKI includes services and protocols for managing 
public keys, often through the use of Certification Authority (CA) and Registration Authority (RA) 
components, but not necessarily for performing cryptographic operations with the keys. 


Among the services likely to be found in a PKI are the following: 
e Key registration: issuing a new certificate for a public key. 
e Certificate revocation: canceling a previously issued certificate. 
e Key selection: obtaining a party’s public key. 
e Trust evaluation: determining whether a certificate is valid and what operations it authorizes. 


Key recovery has also been suggested as a possible aspect of a PKI. 


There is no single pervasive public-key infrastructure today, though efforts to define a PKI generally 
presume there will eventually be one, or, increasingly, that multiple independent PKIs will evolve 
with varying degrees of coexistence and interoperability. In this sense, the PKI today can be viewed 
akin to local and wide-area networks in the 1980’s, before there was widespread connectivity via the 
Internet. As a result of this view toward a global PKI, certificate formats and trust mechanisms 
are defined in an open and scaleable manner, but with usage profiles corresponding to trust and 
policy requirements of particular customer and application environments. For instance, it is usually 
accepted that there will be multiple “root” or “top-level” certificate authorities in a global PKI, not 
just one “root,” although in a local PKI there may be only one root. Accordingly, protocols are 
defined with provision for specifying which roots ate trusted by a given application or user. 


Efforts to define a PKI today are underway in several governments as well as standards organizations. 
The U.S. Department of the Treasury and NIST both have PKI programs [2,3], as do Canada [4] 
and the United Kingdom [5]. NIST has published an interoperability profile for PKI components 
[BDN97]; it specifies algorithms and certificate formats that certification authorities should support. 
Some standards bodies which have worked on PKI aspects have included the IETF’s PKIX and 
SPKI working groups [6,7] and The Open Group [8]. 

Most PKI definitions are based on X.509 certificates, with the notable exception of the IETF’s SPKI. 


[1] PKI -- PC Webopedia Definitions and Links: 


http://webopedia.internet.com/TERM/P/PKI.html]. 


[2] Government Information Technology Services, Federal Public key Infrastructure: 


gits-sec.treas.gov/ootpkisteer.htn]. 


[3] NIST Public key Infrastructure Program: 


http://csrc.ncsl.nist.gov/pki/|. 


[4] The Government of Canada Public key Infrastructure: 


http://www.cse.dnd.ca/cse/english/gov.htmll. 


[5] The Open Group Public key Infrastructure, Latest Proposals for an HMG PKI. 
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4.1.3.2 Who needs a key pair? 


Anyone who wishes to sign messages or to receive encrypted messages must have a key pair. People 
may have more than one key pair. In fact, it is advisable to use separate key pairs for signing messages 
and receiving encrypted messages. As another example, someone might have a key pair affiliated 
with his or her work and a separate key pair for personal use. Other entities may also have key pairs, 
including electronic entities such as modems, workstations, web servers (web sites) and printers, as 
well as organizational entities such as a corporate department, a hotel registration desk, or a university 
registrar’s office. Key pairs allow people and other entities to authenticate (see Question and 
encrypt messages. 


Corporations may require more than one key pair for communication. They may use one or more 
key pairs for encryption (with the keys stored under key escrow to safeguard the key in event of 
loss) and use a single non-escrowed key pair for authentication. The lengths of the encryption and 
authentication key pairs may be varied according to the desired security. 


4.1.3.3 How does one geta key pair? 


A user can generate his or her own key pair, or, depending on local policy, a security officer may 
generate key pairs for all users. There are tradeoffs between the two approaches. In the former, the 
user needs some way to trust his or her copy of the key generation software, and in the latter, the user 
must trust the security officer and the private key must be transferred securely to the user. Typically, 
each node on a network should be capable of local key generation. Secret-key authentication systems, 
such as Kerberos, often do not allow local key generation, but instead use a central server to generate 
keys. 


Once a key has been generated, the user must register his or her public key with some central 
administration, called a Certifying Authority (CA). The CA returns to the user a certificate attesting 
to the validity of the user’s public key along with other information (see Questions 4.1.3. 1044.1.3.12). 
If a security officer generates the key pair, then the security officer can request the certificate for the 
user. Most users should not obtain more than one certificate for the same key, in order to simplify 
various bookkeeping tasks associated with the key. 
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4.1.3.4 Should a key pair be shared among users? 


Users who share a private key can impersonate one another (that is, sign messages as one another 
and decrypt messages intended for one another), so in general, private keys should not be shared 
among users. However, some parts of a key may be shared, depending on the algorithm (see 


Question B.6.12). 


In RSA, while each person should have a unique modulus and private exponent (that is, a unique 
private key), the public exponent can be common to a group of users without security being 
compromised. Some public exponents in common use today are 3 and 21% + 1; because these 
numbers are small, the public key operations (encryption and signature verification) are fast relative 
to the private key operations (decryption and signing). If one public exponent becomes standard, 
software and hardware can be optimized for that value. However, the modulus should not be shared. 


In public-key systems based on discrete logarithms, such as Diffie-Hellman, DSA, and ElGamal 
(see Question B.6.1) Section B.4) and Question B.6.8), a group of people can share a set of system 
parameters, which can lead to simpler implementations. This is also true for systems based on elliptic 
curve discrete logarithms. It is worth noting, however, that this would make breaking a key more 
attractive to an attacker because it is possible to break every key with a given set of system parameters 
with only slightly more effort than it takes to break a single key. To an attacker, therefore, the average 
cost to break a key is much lower with a set common parameters than if every key had a distinct set 
of parameters. 


4.1.3.5 What happens when a key expires? 


In order to guard against a long-term cryptanalytic attack, every key must have an expiration date 
after which it is no longer valid (see Question f.1.2.3). The time to expiration must therefore be 
much shorter than the expected time for cryptanalysis. That is, the key length must be long enough 
to make the chances of cryptanalysis before key expiration extremely small. The validity period for 
a key pair may also depend on the circumstances in which the key is used. The appropriate key size 
is determined by the validity period, together with the value of the information protected by the 
key and the estimated strength of an expected attacker. In a certificate (see Question A.1.3.T0), the 
expiration date of a key is typically the same as the expiration date of the certificate, though it need 
not be. 


A signature verification program should check for expiration and should not accept a message signed 
with an expired key. This means that when one's own key expires, everything signed with it will 
no longer be considered valid. Of course, there will be cases in which it is important that a signed 
document be considered valid for a much longer period of time. Question [7-1] discusses digital 
timestamping as a way to achieve this. 


After expiration, the old key should be destroyed to preserve the security of old messages (note, 
however, that an expited key may need to be retained for some period in order to decrypt 
messages that are still outstanding but encrypted before the key’s expiration). At this point, the 
user should typically choose a new key, which should be longer than the old key to reflect both the 
performance increase of computer hardware and any recent improvements in factoring algorithms 
(see Question f.1.2.]] for recent key length recommendations). 


However, if a key is sufficiently long and has not been compromised, the user can continue to use the 
same key. In this case, the certifying authority would issue a new certificate for the same key, and all 
new signatures would point to the new certificate instead of the old. However, the fact that computer 
hardware continues to improve makes it prudent to replace expired keys with newer, longer keys 
every few yeats. Key replacement enables one to take advantage of any hardware improvements 
to increase the security of the cryptosystem. Faster hardware has the effect of increasing security, 
perhaps vastly, but only if key lengths are increased regularly (see Question 2.3.5). 
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4.1.3.6 What happens if my key is lost? 


If your private key is lost or destroyed but not compromised, you can no longer sign or decrypt 
messages, but anything previously signed with the lost key is still valid. The CA (see Question f.1.3.12) 
must be notified immediately so that the key can be revoked and placed on a certificate revocation 
list (see Question K.1.3.16) to prevent any illegitimate use if the key is found or recovered by an 
adversary. Loss of a private key can happen, for example, if you lose the smart card used to store 
your key, or if the disk on which the key is stored is damaged. You should also obtain a new key 
right away to minimize the number of messages people send you that are encrypted under your old 
key, since these can no longer be read. 


4.1.3.7 What happens if my private key is compromised? 


If your private key is compromised, that is, if you suspect an attacker may have obtained your private 
key, then you should assume the attacker can read any encrypted messages sent to you under the 
corresponding public key, and forge your signature on documents as long as others continue to 
accept that public key as yours. The seriousness of these consequences underscores the importance 
of protecting your private key with extremely strong mechanisms (see Question K.1.3.8). 


You must immediately notify any certifying authorities for the public keys and have your public key 
placed on a certificate revocation list (see Question 1.3.10); this will inform people that the private 
key has been compromised and the public key has been revoked. Then generate a new key pair 
and obtain a new certificate for the public key. You may wish to use the new private key to re-sign 
documents you had signed with the compromised private key, though documents that had been 
timestamped as well as signed might still be valid (see Question [7.11). You should also change the 
way you store your private key to prevent a compromise of the new key. 
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4.1.3.8 How should | store my private key? 


Private keys must be stored securely, since forgery and loss of privacy could result from compromise 
(see Question 4.1.3.7). The measures taken to protect a private key must be at least equal to the 
required security of the messages encrypted with that key. In general, a private key should never be 
stored anywhere in plaintext form. The simplest storage mechanism is to encrypt a private key under 
a password and store the result on a disk. However, passwords are sometimes very easily guessed; 
when this scheme is followed, a password should be chosen very carefully since the security is tied 
directly to the password. 


Storing the encrypted key on a disk that is not accessible through a computer network, such as a 
floppy disk or a local hard disk, will make some attacks more difficult. It might be best to store 
the key in a computer that is not accessible to other users or on removable media the user can 
remove and take with her when she has finished using a particular computer. Private keys may also 
be stored on portable hardware, such as a smart card. Users with extremely high security needs, 
such as certifying authorities, should use tamper-resistant devices to protect their private keys (see 


Question 4.1.3.13). 


4.1.3.9 How dolfind someone else’s public key? 


Suppose Alice wants to find Bob's public key. There are several possible ways of doing this. She 
could call him up and ask him to send his public key via e-mail. She could request it via e-mail, 
exchange it in person, as well as many other ways. Since the public key is public knowledge, there is 
no need to encrypt it while transferring it, though one should verify the authenticity of a public key. 
A mischievous third party could intercept the transmission, replace Bob’s key with his or her own 
and thereby be able intercept and decrypt messages that are sent from Alice to Bob and encrypted 
using the “fake” public key. For this reason one should personally verify the key (for example, 
this can be done by computing a hash of the key and verifying it with Bob over the phone) or 
rely on certifying authorities (see Question for more information on certifying authorities). 
Certifying authorities may provide directory services; if Bob works for company Z, Alice could look 
in the directory kept by Z’s certifying authority. 


Today, full-fledged directories are emerging, serving as on-line white or yellow pages. Along with 
ITU-T X.509 standards (see Question 6.3.2), most directories contain certificates as well as public 
keys; the presence of certificates lower the directories’ security needs. 
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4.1.3.10 Whatare certificates? 


Certificates are digital documents attesting to the binding of a public key to an individual or other 
entity. They allow verification of the claim that a specific public key does in fact belong to a specific 
individual. Certificates help prevent someone from using a phony key to impersonate someone else. 
In some cases it may be necessary to create a chain of certificates, each one certifying the previous 
one until the parties involved are confident in the identity in question. 


In their simplest form, certificates contain a public key and a name. As commonly used, a certificate 
also contains an expiration date, the name of the certifying authority that issued the certificate, a 
serial number, and perhaps other information. Most importantly, it contains the digital signature of 
the certificate issuer. The most widely accepted format for certificates is defined by the ITU-T X.509 
international standard (see Question b.3.2); thus, certificates can be read or written by any application 
complying with X.509. A detailed discussion of certificate formats can be found in [Ken93]. 


4.1.3.11 How are certificates used? 


Certificates are typically used to generate confidence in the legitimacy of a public key. Certificates are 
essentially digital signatures that protect public keys from forgery, false representation, or alteration. 
The verification of a signature therefore can include checking the validity of the certificate for the 
public key involved. Such verification steps can be performed with greater or lesser rigor depending 
on the context. 


The most secure use of authentication involves associating one or more certificates with every signed 
message. The receiver of the message would verify the certificate using the certifying authority’s 
public key and, now confident of the public key of the sender, verify the message’s signature. There 
may be two or more certificates enclosed with the message, forming a hierarchical certificate chain, 
wherein one certificate testifies to the authenticity of the previous certificate. At the end of a 
certificate hierarchy is a top-level certifying authority, which is trusted without a certificate from any 
other certifying authority. The public key of the top-level certifying authority must be independently 
known, for example, by being widely published. It is interesting to note that there are alternative 
trust models being pursued by a variety of researchers that avoid this hierarchical approach. 


The more familiar the sender is to the receiver of the message, or more precisely, the mote trust 
the receiver places in the claim that the public key really is that of the sender, the less need there 
is to enclose and verify certificates. If Alice sends messages to Bob every day, Alice can enclose a 
certificate chain on the first day that Bob verifies. Bob thereafter stores Alice’s public key and no 
more certificates or certificate certifications are necessary. A sender whose company is known to the 
receiver may need to enclose only one certificate (issued by the company), whereas a sender whose 
company is unknown to the receiver may need to enclose two or more certificates. A good rule of 
thumb is to enclose just enough of a certificate chain so the issuer of the highest level certificate 
in the chain is well known to the receiver. If there are multiple recipients then enough certificates 
should be included to cover what each recipient might need. 
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4.1.3.12 Who issues certificates and how? 


Certificates are issued by a certifying authority (CA), which can be any trusted central administration 
willing to vouch for the identities of those to whom it issues certificates and their association with 
a given key. A company may issue certificates to its employees, or a university to its students, or a 
town to its citizens. In order to prevent forged certificates, the CA’s public key must be trustworthy: 
a CA must either publicize its public key or provide a certificate from a higher-level CA attesting to 


the validity of its public key. The latter solution gives rise to hierarchies of CAs. See Figure H.T] for 
an example. 


Certificate issuance proceeds as follows. Alice generates her own key pair and sends the public key to 
an appropriate CA with some proof of her identification. The CA checks the identification and takes 
any other steps necessary to assure itself the request really did come from Alice and that the public 
key was not modified in transit, and then sends her a certificate attesting to the binding between 
Alice and her public key along with a hierarchy of certificates verifying the CA’s public key. Alice can 
present this certificate chain whenever desired in order to demonstrate the legitimacy of her public 
key. Since the CA must check for proper identification, organizations find it convenient to act as a 


CA for their own members and employees. There are also CAs that issue certificates to unaffiliated 
individuals. 


IPRA IPRA 
ZN 
RSADSI RSADSI Hierarchy 
: RSADSI TIS Medium Certifiers 
Commercial : ; Low A 
Residential Assurance VAN 
Assurance Assurance y 
Motorola Apple LA SF MIT USC Personal Certification 
Authorities 
ZN 
Empl. Empl. $ 
Users Users Users Users Users S 
Affil. Affil. End-Users 


Figure 4.1: Example of a certification hierarchy. 


Different CAs may issue certificates with varying levels of identification requirements. One CA may 
insist on seeing a driver's license, another may want the certificate request form to be notarized, yet 
another may want fingerprints of anyone requesting a certificate. Each CA should publish its own 
identification requirements and standards, so verifiers can attach the appropriate level of confidence 
to the certified name-key bindings. CA’s with lower levels of identification requirements produce 
certificates with lower “assurance.” CA’s can thus be considered to be of high, medium, and low 


assurance. One type of CA is the persona CA. This type of CA creates certificates that bind only 
e-mail addresses and their corresponding public keys. It is designed for users who wish to remain 
anonymous yet want to be able to participate in secure electronic services. 


An example of a certificate-issuing protocol is found in Apple Computer's System 7.5 for the 
Macintosh. System 7.5 users can generate a key pair and then request and receive a certificate for the 


public key; the certificate request must be notarized. 


Certificate-related technologies are available from a number of vendors, including 


e Baltimore ([http:/7/www.baltimore.com/). 
e Entrust Technologies ([http://www.entrust.com/]). 


e RSA Security ([http://waw. rsasecurity.com/). 


e VeriSign ([http://waw. verisign. com/]). 
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4.1.3.13 How do certifying authorities store their private keys? 


It is extremely important that the private keys of certifying authorities (see Question are 
stored securely. The compromise of this information would allow the generation of certificates for 
fraudulent public keys. One way to achieve the desired security is to store the key in a tamper-resistant 
device. The device should preferably destroy its contents if ever opened, and be shielded against 
attacks using electromagnetic radiation. Not even employees of the certifying authority should have 
access to the private key itself, but only the ability to use the private key in the process of issuing 
certificates. 


There are many possible designs for controlling the use of a certifying authority’s private key. BBN’s 
SafeKeyper, for instance, is activated by a set of data keys, which are physical keys capable of storing 
digital information. The data keys use secret sharing technology so that several people must use 
their data keys to activate the SafeKeyper. This prevents a disgruntled CA employee from producing 
phony certificates. 


Note that if the certificate-signing device is destroyed accidentally, then no security is compromised. 
Certificates signed by the device are still valid, as long as the verifier uses the correct public key. 
Moreover, some devices ate manufactured so a lost private key can be restored into a new device. 
(see Question f.1.3.15] for a discussion of lost CA private keys). 


4.1.3.14 How are certifying authorities susceptible to attack? 


One can think of many attacks aimed at certifying authorities (see Question all of which can 
be defended against. For instance, an attacker may attempt to discover the private key of a certifying 
authority by reverse engineering the device in which itis stored. For this reason, a certifying authority 
must take extreme precautions to prevent illegitimate access to its private key; see Question 
for discussion. 


The certifying authority’s key pair might be the target of an extensive cryptanalytic attack. For 
this reason, CAs should use long keys, and should also change keys regularly. Top-level certifying 
authorities need especially long keys, as it may not be practical for them to change keys frequently 
because the public key may be written into software used by a large number of verifiers. 


What if an attacker breaks a CA’s key, but the CA is no longer using it? Though the key has long 
since expired, the attacker, say Alice, can now forge a certificate dated 15 years ago attesting to a 
phony public key of some other person, say Bob. Alice can then forge a document with a signature 
of Bob dated 15 years ago, perhaps a will leaving everything to Alice. The underlying issue raised 
by this attack is how to authenticate a signed document dated many years ago. Timestamps are the 
solution in this case (see Question [7.11). 


There are other attacks to consider that do not involve the compromise of a CA’s private key. For 
instance, suppose Bob wishes to impersonate Alice. If Bob can convincingly sign messages as Alice, 
he can send a message to Alice’s bank saying “I wish to withdraw $10,000 from my account. Please 
send me the money.” To carry out this attack, Bob generates a key pair and sends the public key 
to a certifying authority saying “I’m Alice. Here is my public key. Please send me a certificate.” If 
the CA is fooled and sends him such a certificate, he can then fool the bank, and his attack will 
succeed. In order to prevent such an attack, the CA must verify that a certificate request did indeed 
come from its purported author, that is, it must require sufficient evidence that it is actually Alice 
who is requesting the certificate. The CA may, for example, require Alice to appear in person and 
show a birth certificate. Some CAs may require very little identification, but the bank should not 
honor messages authenticated with such low-assurance certificates. Every CA must publicly state its 
identification requirements and policies so others can then attach an appropriate level of confidence 
to the certificates. 


In another attack, Bob bribes someone who works for the CA to issue to him a certificate in the 
name of Alice. Now Bob can send messages signed in Alice’s name and anyone receiving such a 
message will believe it is authentic because a full and verifiable certificate chain will accompany the 
message. This attack can be hindered by requiring the cooperation of two (or more) employees to 
generate a certificate; the attacker now has to bribe two or more employees rather than one. 


Unfortunately, there may be other ways to generate a forged certificate by bribing only one employee. 
If each certificate request is checked by only one employee, that one employee can be bribed and slip 
a false request into a stack of real certificate requests. Note that a corrupt employee cannot reveal 
the certifying authority’s private key as long as it is properly stored. 
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A CA should also be certain that a user possesses the private key corresponding to the public key 
that is certified; otherwise, certain attacks become possible where the user attaches a certificate to a 


message signed by someone else (see [KaI93b)). (See also [MQV95] for a discussion of this issue in 


the context of key agreement protocols.) 


4.1.3.15 What ifa certifying authority’s key is lost or compromised? 


If the certifying authority’s key is lost or destroyed but not compromised, certificates signed with the 
old key are still valid, as long as the verifier knows to use the old public key to verify the certificate. 
In some designs for certificate-signing devices, encrypted backup copies of the CA’s private key are 
kept, so a CA that loses its key can then restore it by loading the encrypted backup into the device. 
If the device itself is destroyed, the manufacturer may be able to supply another one with the same 
internal information, thus allowing recovery of the key. 


A compromised CA key is a much more dangerous situation. An attacker who discovers a certifying 
authority’s private key can issue phony certificates in the name of the certifying authority, which 
would enable undetectable forgeries. For this reason, all precautions must be taken to prevent 


compromise, including those outlined in Questions K.1.3.13 and Question f.1.3.14, 


If a compromise does occur, the CA must immediately cease issuing certificates under its old key and 
change to a new key. If it is suspected that some phony certificates were issued, all certificates should 
be recalled and then reissued with the new CA key. These measures could be relaxed somewhat if 
the certificates were registered with a digital timestamping service (see Question [7-TT). Note that 
compromise of a CA key does not invalidate users’ keys, but only the certificates that authenticate 
them. Compromise of a top-level CA’s private key should be considered catastrophic, since the 
public key may be built into applications that verify certificates. 
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4.1.3.16 What are Certificate Revocation Lists (CRLs)? 


A certificate revocation list (CRL) is a list of certificates that have been revoked before their scheduled 
expiration date. There are several reasons why a certificate might need to be revoked and placed on 
a CRL. For instance, the key specified in the certificate might have been compromised or the user 
specified in the certificate may no longer have authority to use the key. For example, suppose the 
user name associated with a key is “Alice Avery, Vice President, Argo Corp.” If Alice were fired, her 
company would not want her to be able to sign messages with that key, and therefore the company 
would place the certificate on a CRL. 


When verifying a signature, one examines the relevant CRL to make sure the signer's certificate has 
not been revoked. Whether 1t is worth the time to perform this check depends on the importance 
of the signed document. A CRL is maintained by a CA, and it provides information about revoked 
certificates that were issued by that CA. CRLs only list current certificates, since expired certificates 
should not be accepted in any case: when a revoked certificate’s expiration date occurs, that certificate 
can be removed from the CRL. 


CRLs are usually distributed in one of two ways. In the “pull” model, verifiers download the CRL 
from the CA, as needed. In the “push” model, the CA sends the CRL to the verifiers at regular 
intervals. Some systems use a hybrid approach where the CRL is pushed to several intermediate 
repositories from which the verifiers may retrieve it as needed. 


Although CRLs are maintained in a distributed manner, there may be central repositories for CRLs, 
such as, network sites containing the latest CRLs from many organizations. An institution like a 
bank might want an in-house CRL repository to make CRL searches on every transaction feasible. 
The original CRL proposals often required a list, per issuer, of all revoked certificates; new certificate 
revocation methods (for example, in X.509 version 3; see Question 6.3.2) are more flexible. 


4.2 ELECTRONIC COMMERCE 


This section deals with electronic commerce, payment systems, and transactions over open networks. 
While several protocols and payment systems are described in this section, the most widely used 
protocol for internet transactions is SSL, which is described in the answer to Question 6.1.2, 


4.2.1 Whatis electronic money? 


Electronic money (also called electronic cash or digital cash) is a term that is still fairly vague and 
undefined. It refers to transactions carried out electronically with a net result of funds transferred 
from one party to another. Electronic money may be either debit or credit. Digital cash per se 
is basically another currency, and digital cash transactions can be visualized as a foreign exchange 
market. This is because we need to convert an amount of money to digital cash before we can spend 
it. The conversion process is analogous to purchasing foreign currency. 


Pioneer work on the theoretical foundations of digital cash was carried out by Chaum [Cha83] 
[Cha85]. Digital cash in its precise definition may be anonymous or identified. Anonymous schemes 
do not reveal the identity of the customer and are based on blind signature schemes (see Question[/.3). 
Identified spending schemes always reveal the identity of the customer and are based on mote general 
forms of signature schemes. Anonymous schemes ate the electronic analog of cash, while identified 
schemes are the electronic analog of a debit or credit card. There are other approaches, payments 
can be anonymous with respect to the merchant but not the bank, or anonymous to everyone, but 
traceable (a sequence of purchases can be related, but not linked directly to the spender’s identity). 


Since digital cash is merely an electronic representation of funds, it is possible to easily duplicate 
and spend a certain amount of money more than once. Therefore, digital cash schemes have been 
structured so that it is not possible to spend the same money more than once without getting caught 
immediately or within a short period of time. Another approach is to have the digital cash stored in 
a secure device, which prevents the user from double spending. 


Electronic money also encompasses payment systems that are analogous to traditional credit cards 
and checks. Here, cryptography protects conventional transaction data such as an account number 
and amount; a digital signature can replace a handwritten signature or a credit-card authorization, 
and public-key encryption can provide confidentiality. There are a variety of systems for this type 
of electronic money, ranging from those that are strict analogs of conventional paper transactions 
with a typical value of several dollars or more, to those (not digital cash per se) that offer a form of 
“micropayments” where the transaction value may be a few pennies or less. The main difference 
is that for extremely low-value transactions even the limited overhead of public-key encryption and 
digital signatures is too much, not to mention the cost of “clearing” the transaction with bank. As a 
result, “batching” of transactions is required, with the public key operations done only occasionally. 


Several web pages surveying payment systems and other forms of electronic money are available, 
including the following: 


es.cs.tcd.ie/mepeirce/Project/oninternet.html] by Michael Peirce. 


e [http://www.w3.org/hypertext/WWW/Payments/roadmap.html] by Phillip Hallam-Baker. 
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e [http://nii.isi.edu/info/netcheque/related.html], part of the NetCheque project at 
the Information Sciences Institute (University of Southern California). 
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4.2.2 Whatis iKP? 


The Internet Keyed Payments Protocol (KP) is an architecture for secure payments involving three 
or more parties [BGHY5]. Developed at IBM’s T.J. Watson Research Center and Zurich Research 
Laboratory, the protocol defines transactions of a “credit card” nature, where a buyer and seller 
interact with a third party “acquirer,” such as a credit-card system or a bank, to authorize transactions. 


The protocol is based on public-key cryptography. 
iKP is no longer widely in use, however it is the current foundation for SET (see Question 1.2.3). 
Additional information on ¡KP is available from 


extern/ecommerce/iKP.htmll|. 
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4.2.3 Whatis SET? 


Visa and MasterCard have jointly developed the Secure Electronic Transaction (SET) protocol as a 
method for secure, cost effective bankcard transactions over open networks. SET includes protocols 
for purchasing goods and services electronically, requesting authorization of payment, and requesting 
“credentials” (that is, certificates) binding public keys to identities, among other services. Once SET 
is fully adopted, the necessary confidence in secure electronic transactions will be in place, allowing 
merchants and customers to partake in electronic commerce. 


SET supports DES (see Section for bulk data encryption and RSA (see Question B.T.T) for 
signatures and public-key encryption of data encryption keys and bankcard numbers. The RSA 
public-key encryption employs Optimal Asymmetric Encryption Padding [BRIA]. 


SET is being published as open specifications for the industry, which may be used by software 
vendors to develop applications. 


More information can be found at 


http: //www.visa.co 


and 


http: //www.mastercard.con]. 


4.2.4 Whatis Mondex? 


Mondex is a payment system in which currency is stored in smart cards. These smart cards are 
similar in shape and size to credit cards, and generally permit the storage of sums of money up to 
several hundred dollars. Money may be transferred from card to card arbitrarily many times and 
in any chosen amounts. There is no concern about coin sizes, as with traditional currency. The 
Mondex system also provides a limited amount of anonymity. The system carries with it one of 
the disadvantages of physical currency: if a Mondex card is lost, the money it contains is also lost. 
Transfers of funds from card to card are effected with any one of a range of intermediate hardware 
devices. 


The Mondex system relies for its security on a combination of cryptography and tamper-resistant 
hardware. The protocol for transferring funds from one card to another, for instance, makes 
use of digital signatures (although Mondex has not yet divulged information about the algorithms 
employed). Additionally, the system assumes that users cannot tamper with cards, that is, access and 
alter the balances stored in their cards. The Mondex system is managed by a corporation known as 
Mondex International Ltd., with a number of associated national franchises. Pilots of the system 
have been initiated in numerous cities around the world. 


For mote information on Mondex, visit their web site at 


http: //www.mondex.con]. 
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4.2.5 What are micropayments? 


Micropayments are payments of small sums of money, generally in denominations smaller than those 
in which physical currency is available. It is envisioned that sums of as little as 1/1000th of a cent may 
someday be used to pay for content access or for small quantities of network resources. Conventional 
electronic payment systems require too much computation to handle such sums with acceptable 
efficiency. Micropayment systems enable payments of this size to be achieved in a computationally 
lightweight manner, generally by sacrificing some degree of security. 


One example of a micropayment system, proposed by Rivest and Shamir, is known as MicroMint. 
In MicroMint, a coin consists of a hash collision computed under certain carefully tuned constraints. 
By investing in extensive computational resources, a mint may compute a number of these coins, 
and then sell them in batches. MicroMint exploits the efficiency of hash function calculations: any 
party can quickly verify the legitimacy of coin. While deriving new coins is hard in MicroMint, it 
is possible for users to re-spend the same coin or to set up an expensive forging operation. The 
MicroMint system addresses this shortcoming by presuming that it will not be worthwhile for a user 
to cheat in this manner. 


CHAPTER 3 
Cryptography in the Real W orld 


This chapter goes over some of the most important cryptographic systems in place around the world 
today, including secure Internet communications, and some of the more popular cryptographic 
products. It also gives an overview of the major groups of cryptographic standards. 


5.1 SECURITY ON THE INTERNET 
5.1.1 Whatis S/MIME? 


S/MIME (Secure / Multipurpose Internet Mail Extensions) is a protocol that adds digital signatures 
and encryption to Internet MIME (Multipurpose Internet Mail Extensions) messages described in 
RFC 1521. MIME is the official proposed standard format for extended Internet electronic mail. 
Internet e-mail messages consist of two parts, the header and the body. The header forms a collection 
of field/value pairs structured to provide information essential for the transmission of the message. 
The structure of these headers can be found in RFC 822. The body is normally unstructured unless 
the e-mail is in MIME format. MIME defines how the body of an e-mail message is structured. The 
MIME format permits e-mail to include enhanced text, graphics, audio, and more in a standardized 
manner via MIME-compliant mail systems. However, MIME itself does not provide any security 
services. The purpose of S/MIME is to define such services, following the syntax given in PKCS #7 
(see Question 6.3.3) for digital signatures and encryption. The MIME body section carries a PKCS 
#7 message, which itself is the result of cryptographic processing on other MIME body sections. 
S/MIME standardization has transitioned into IETF, and a set of documents describing S/MIME 
version 3 have been published there. 


S/MIME has been endorsed by a number of leading networking and messaging vendors, including 
ConnectSoft, Frontier, FTP Software, Qualcomm, Microsoft, Lotus, Wollongong, Banyan, NCD, 
SecureWare, VeriSign, Netscape, and Novell. For information on S/MIME, check 

http: //www.rsasecurity.com/standards/smime/fag.html]. 


Information on MIME can be found at 


tip: tp.isi.edu/in-notes/rfc1521.txt]. 
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5.1.2 Whatis SSL? 


The SSL (Secure Sockets Layer) Handshake Protocol [Hic95] was developed by Netscape Com- 
munications Corporation to provide security and privacy over the Internet. The protocol supports 
server and client authentication. The SSL protocol is application independent, allowing protocols 
like HTTP (HyperText Transfer Protocol), FTP (File Transfer Protocol), and Telnet to be layered 
on top of it transparently. Still, SSL is optimized for HTTP; for FTP, IPSec (see Question B.1.4) 
might be preferable. The SSL protocol is able to negotiate encryption keys as well as authenticate 
the server before data is exchanged by the higher-level application. The SSL protocol maintains the 
security and integrity of the transmission channel by using encryption, authentication and message 
authentication codes. 


The SSL Handshake Protocol consists of two phases: server authentication and an optional client 
authentication. In the first phase, the server, in response to a client's request, sends its certificate and 
its cipher preferences. The client then generates a master key, which it encrypts with the server’s 
public key, and transmits the encrypted master key to the server. The server recovers the master 
key and authenticates itself to the client by returning a message authenticated with the master key. 
Subsequent data is encrypted and authenticated with keys derived from this master key. In the 
optional second phase, the server sends a challenge to the client. The client authenticates itself to the 
server by returning the client’s digital signature on the challenge, as well as its public-key certificate. 


A variety of cryptographic algorithms ate supported by SSL. During the “handshaking” process, the 
RSA public-key cryptosystem (see Section B.T) is used. After the exchange of keys, a number of 
ciphers are used. These include RC2 (see Question B.6.2), RC4 (see Question B.6.3), IDEA (see 
Question B.6.7), DES (see Section B.2), and triple-DES (see Question B.2.6). The MD5 message- 
digest algorithm (see Question B.6.6) is also used. The public-key certificates follow the X.509 syntax 


(see Question B.3.3). 
For mote information on SSL 3.0, see 
http://home.netscape.com/eng/ss13/index.htmll. 


TLS (Transport Layer Security) is a protocol that is based on and very similar to SSL 3.0; for more 
information about TLS 1.0, see 


We should also mention WTLS (Wireless TLS), which specifies the security layer protocol in WAP 
(Wireless Application Protocol); WAP is the de facto standard for the delivery and presentation of 
information to wireless devices such as mobile phones and pagers. WTLS is very similar to TLS but 
optimized for low-bandwidth bearer networks. For more information on WAP and WTLS, see 


http://www.waptforum.org/what/technical.htn]. 


5.1.3 Whatis S/WAN? 


The S/WAN (Secure Wide Area Network, pronounced “swan”) was an initiative to promote the 
widespread deployment of Internet-based Virtual Private Networks (VPNs). This was accomplished 
by adopting a standard specification for implementing IPSec, the security architecture for the 
Internet Protocol (see Question 6.1.4), thereby ensuring interoperability among firewall and TCP/IP 
products. The use of IPSec allows companies to mix-and-match the best firewall and TCP/IP stack 
products to build Internet-based VPNs. Currently, users and administrators are often locked in to 
single-vendor solutions network-wide, because vendors have been unable to agree upon the details 
of an IPSec implementation. The S/WAN effort should therefore remove a major obstacle to the 
widespread deployment of secure VPNs. 


S/WAN supported encryption at the IP level, which provides more fundamental and lower-level 
security than higher-level protocols, such as SSL (see Question 6.1.2). It was expected that higher-level 
security specifications, including SSL, would be routinely layered on top of S/WAN implementations, 
and these security specifications would work together. 


While S/WAN is no longer an active initiative, there are other related ongoing projects such as 


Linux FreeS/WAN ( ) and the Virtual Private Network Consortium 
(VPNC; see ). Linux FreeS/Wan is a free implementation of IPSec 


and IKE (Internet Key Exchange) for Linux, while VPNC is an international trade association for 
manufacturers in the VPN market. 
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5.1.4 Whatis IPSec? 


The Internet Engineering Task Force 1ETE)s IP Security Protocol (IPSec) working group is defining 
asetof specifications for cryptographically-based authentication, integrity, and confidentiality services 
at the IP datagram layer. IPSec is intended to be the future standard for secure communications 
on the Internet, but is already the de facto standard. The IPSec group's results comprise a basis 
for interoperably secured host-to-host pipes, encapsulated tunnels, and Virtual Private Networks 
(VPNs), thus providing protection for client protocols residing above the IP layer. 


The protocol formats for [PSec’s Authentication Header (AH) and IP Encapsulating Security Payload 
(ESP) are independent of the cryptographic algorithm, although certain algorithm sets are specified as 
mandatory for support in the interest of interoperability. Similarly, multiple algorithms are supported 
for key management purposes (establishing session keys for traffic protection), within IPSec's IKE 
framework. 


The home page of the working group is located at 


html.charters/ipsec-charter.htmll. 


This site contains links to relevant RFC documents and Internet-Drafts. 


5.1.5 Whatis SSH? 


SSH, or Secure Shell, is a protocol which permits secure remote access over a network from one 
computer to another. SSH negotiates and establishes an encrypted connection between an SSH 
client and an SSH server, authenticating the client and server in any of a variety of ways (some of the 
possibilities for authentication are RSA, SecurID, and passwords). That connection can then be used 
for a variety of purposes, such as creating a secure remote login on the server (effectively replacing 
commands such as telnet, rlogin, and rsh) or setting up a VPN (Virtual Private Network). 


When used for creating secure logins, SSH can be configured to forward X11 connections 
automatically over the encrypted “tunnel” so as to give the remote user secure access to the SSH 
server within a full-featured windowing environment. SSH connections and their X11 forwarding 
can be cascaded to give an authenticated user convenient secure windowed access to a complete 
network of hosts. Other TCP/IP connections can also be tunneled through SSH to the server so 
that the remote user can have secure access to mail, the web, file sharing, FTP, and other services. 


The SSH protocol is currently being standardized in the IETF’s SECSH working group: 
http://www.1ietí.org/html.charters/secsh-charter.html]. 


More information about SSH, including how to obtain commercial implementations, is available 


from 
e SSH Communications Security ([http:7/www.ssh.f1)). 
+ Data Fellows ( [KEP 7m Aara co) 
e Van Dyke Technologies ([http://www.vandyke . con). 
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5.1.6 Whatis Kerberos? 


Kerberos [KNT94] is an authentication service developed by the Project Athena team at MIT, based 
on a 1978 paper by Needham and Schroeder [NS78]. The first general use version was version 
4. Version 5, which addressed certain shortfalls in version 4, was released in 1994. Kerberos uses 
secret-key ciphers (see Question for encryption and authentication. Version 4 could only use 
DES (see Section B.2). Unlike a public-key authentication system, Kerberos does not produce digital 
signatures (see Question 2.2.2). Instead Kerberos was designed to authenticate requests for network 
resources rather than to authenticate authorship of documents. Thus, Kerberos does not provide for 
future third-party verification of documents. 


In a Kerberos system, there is a designated site on each network, called the Kerberos server, which 
performs centralized key management and administrative functions. The server maintains a database 
containing the secret keys of all users, authenticates the identities of users, and distributes session 
keys to users and servers who wish to authenticate one another. Kerberos requires trust in a third 
party (the Kerberos server). If the server is compromised, the integrity of the whole system is lost. 
Public-key cryptography was designed precisely to avoid the necessity to trust third parties with 
secrets (see Question 2.2.1). Kerberos is generally considered adequate within an administrative 
domain; however across domains the more robust functions and properties of public-key systems ate 
often preferred. There has been some developmental work in incorporating public-key cryptography 


into Kerberos [Gan95j. 


For detailed information on Kerberos, read “The Kerberos Network Authentication Service (V5)” 
J. Kohl and C. Neuman, RFC 1510) at 


5.2 DEVELOPMENT SECURITY PRODUCTS 
5.2.1 What are CAPIs? 


A CAPI, or cryptographic application programming interface, is an interface to a library of functions 
software developers can call upon for security and cryptography services. The goal of a CAPI is to 
make it easy for developers to integrate cryptography into applications. Separating the cryptographic 
routines from the software may also allow the export of software without any secutity services 
implemented. The software can later be linked by the user to the local security services. CAPIs 
can be targeted at different levels of abstraction, ranging from cryptographic module interfaces to 
authentication service interfaces. The International Cryptography Experiment (ICE) is an informally 
structured program for testing U.S. government’s export restrictions (see Questions and 
on CAPIs. More information can be obtained about this program by e-mail to . Some 
examples of CAPIs include RSA Laboratories” Cryptoki (PKCS #11; see Question B.3.3), NSA’s 
Fortezza (see Question [6.2.6), Internet GSS-API [Lim93], and GCS-API [UG9G]. NSA has prepared 
a helpful report [NSA95] that surveys some of the current CAPIs. 
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5.2.2 Whatis the GSS-API? 


The Generic Security Service API (GSS-API) is a CAPI for distributed security services. It has 
the capacity to handle session communication securely, including authentication, data integrity, 
and data confidentiality. The GSS-API is designed to insulate its users from the specifics of 
underlying mechanisms. GSS-API implementations have been constructed atop a range of secret-key 
and public-key technologies. The current (Version 2) GSS-API definition is available in Internet 
Proposed Standard RFC 2078 at 


GSS-API is also incorporated as an element of the Open Group Common Environment Specification. 
Related ongoing work items include definitions of a complementary API (GSS-IDUP) oriented to 
store-and-forward messaging, of a negotiation facility for selection of a common mechanism shared 
between peers, and of individual underlying GSS-API mechanisms. For more information on 
GSS-IDUP, see 


5.2.3 What are RSA BSAFE CRYPTO-C and RSA BSAFE CRYPTO-J? 


RSA BSAFE Crypto-C (formerly BSAFE) and RSA BSAFE Crypto-] (formerly JSAFE) are low-level 
ctyptographic toolkits that offer developers the tools to add privacy and authentication features to 
their applications. 


RSA BSAFE CRYPTO-C and RSA BSAFE CRYPTO-J are designed to provide the security tools 
for a wide range of applications, such as digitally signed electronic forms, virus detection, or virtual 
ptivate networks. RSA BSAFE CRYPTO-C can support virtually any global security standard; 
and RSA BSAFE CRYPTO-J is compatible with various industry standards, including S/MIME, 


SSL, S/WAN, IPSec, and SET (see Questions B.T.T] 5.1.3 B.1.4, and #23). RSA BSAFE 
CRYPTO-C and RSA BSAFE CRYPTO-] fully support PKCS (see Question 6.3.3). 


RSA has introduced a whole new family of elliptic curve public-key cryptographic methods to 
RSA BSAFE CRYPTO-C; which now includes elliptic curve cryptographic routines for encryption 
(analogous to the ElGamal encryption system), key agreement (analogous to Diffie-Hellman key 
agreement) and digital signatures (analogous to DSA, Schnorr, etc.). This implementation of elliptic 
curve cryptography includes all variants of ECC: Odd Prime, Even Normal, and Even Polynomial, 
as well as the ability to generate new curve parameters for all three fields. 


RSA BSAFE CRYPTO-] is RSA’s first cryptographic toolkit designed specifically for Java developers. 
RSA BSAFE CRYPTO-] has a full suite of Cryptographic Algorithms including RSA Public-key 
ctyptosystem and Diffie-Hellman Key Negotiation, DES, Triple-DES, RC2, RC4, RC5, MD5and 
SHA-1; RSA BSAFE CRYPTO-] provides developers with a state-of-the-art implementation of 
the most important privacy, authentication, and data integrity routines all in Java. RSA BSAFE 
CRYPTO-J uses the same Java Security API developers are used to. The toolkit also includes source 
code for sample applications and easy-to-use self-test modules. This means proven security and 
shorter time-to-market for new Java project. 


For more information on RSA BSAFE CRYPTO-C, RSA BSAFE CRYPTO-J, and other RSA 


products, see 


http://www.rsasecurity.com/products/]. 
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5.2.4 Whatis SecurPC? 


RSA SecurPC is a software utility that encrypts disks and files on both desktop and laptop personal 
computers. SecurPC extends the Windows™ File Manager or Explorer to include options for 
encrypting and decrypting individually selected files or files within selected folders. Each file is 
encrypted using RC4 (see Question with a randomly generated 128-bit key (40 bits for some 
non-U.S. users.) The random key is encrypted under the user's secret key, which is encrypted under 
a key derived from the user's passphrase. This allows the user's passphrase to be changed without 
decrypting and reencrypting all encrypted files. 


SecurPC provides for optional emergency access to encrypted files, based on a k-of-n threshold 
scheme. The user's secret key may be stored, encrypted with the RSA algorithm, under an emergency 
access public key. The corresponding private key is given, in shares, to any number of trustees. A 
designated number of these trustees must present their shares in order to decrypt the encrypted files. 


SecurPC has been superseded by RSA Security's Keon Desktop, but some information about the 
product may still be found at 


http://www.solwaycomms.co.uk/]. 


5.2.5 Whatis SecurlD? 


SecurID is a two-factor authentication system developed by Security Dynamics (now RSA Security). 
It is generally used to secure either local or remote access to computer networks. Each SecurID user 
has a memorized PIN or password, and a hand-held token with a LCD display. The token displays 
a new pseudo-random value, called the tokencode, at a fixed time interval, usually one minute. The 
user combines the memorized factor with the tokencode, either by simple concatenation or entry on 
an optional keypad on the token, to create the passcode, which is then entered to gain access to the 
protected resource. 


The SecurID token is a battery powered, hand-held device containing a dedicated microcontroller. 
The microcontroller stores, in RAM, the current time, and a 64-bit seed value that is unique to a 
particular token. At the specified interval, the seed value and the time are combined through a 
proprietary algorithm stored in the microcontroller’s ROM, to create the tokencode value. 


An authentication server verifies the passcodes. The server maintains a database which contains the 
seed value for each token and the PIN or password for each user. From this information, and the 
current time, the server generates a set of valid passcodes for the user and checks each one against 
the entered value. For more on SecurID, see 


http: //www.rsasecurity.com/products/securid/. 
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5.2.6 Whatis PGP? 


Pretty Good Privacy (PGP) is a software package originally developed by Philip R. Zimmermann 
that provides cryptographic routines for e-mail and file storage applications. Zimmerman took 
existing cryptosystems and cryptographic protocols and developed a program that can run on 
multiple platforms. It provides message encryption, digital signatures, data compression, and e-mail 


compatibility. 


The default algorithms used for encryption as specified in REC 2440 are, in order of preference, 
ElGamal (see Question and RSA (see Section B.T) for key transport and triple-DES (see 
Question B.2.6), IDEA, and CAST5 (see Question B.6.7) for bulk encryption of messages. Digital 
signatures are achieved by the use of DSA (see Section B.4) or RSA for signing and SHA-1 (see 
Question B.6.5) or MD5 (see Question B.6.6) for computing message digests. The shareware program 
ZIP is used to compress messages for transmission and storage. E-mail compatibility is achieved by 
the use of Radix-64 conversion. 


U.S. versions of PGP have been bound by Federal export laws due to their use of export-controlled 
ctyptosystems, but recent relaxations of the U.S. export restrictions will eliminate several such 
obstacles. 


5.3 CRYPTOGRAPHY STANDARDS 
5.3.1 What are ANSI X9 standards? 


American National Standards Institute (ANSI) is broken down into committees, one being ANSI 
xof] The committee ANSI X9 develops standards for the financial industry, more specifically for 
personal identification number (PIN) management, check processing, electronic transfer of funds, 
etc. Within the committee of X9, there are subcommittees; further broken down are the actual 
documents, such as X9.9 and X9.17. 


ANSI X9.9 [[ANS86al is a United States national wholesale banking standard for authentication 
of financial transactions. ANSI X9.9 addresses two issues: message formatting and the particular 
message authentication algorithm. The algorithm defined by ANSI X9.9 is the so-called DES-MAC 
(see Question 2.1.7) based on DES (see Section B.2) in either CBC or CFB modes (see Question 2.1.4). 
A more detailed standard for retail banking was published as X9.19 [[ANS90]. 


The equivalent international standards are ISO 8730 [[ISOS7]. and ISO 8731 for ANSI X9.9, and 
ISO 9807 for ANSI X9.19. The ISO standards differ slightly in that they do not limit themselves to 
DES to obtain the message authentication code but allow the use of other message authentication 


codes and block ciphers (see Question 6.3.4). 


ANSI X9.17 [ANS95] is the Financial Institution Key Management (Wholesale) standard. It defines 
the protocols to be used by financial institutions, such as banks, to transfer encryption keys. This 
protocol is aimed at the distribution of secret keys using symmetric (secret-key) techniques. Financial 
institutions need to change their bulk encryption keys on a daily or per-session basis due to the 
volume of encryptions performed. This does not permit the costs and other inefficiencies associated 
with manual transfer of keys. The standard therefore defines a three-level hierarchy of keys: 


e The highest level is the master key (KKM), which is always manually distributed. 
e The next level consists of key-encrypting keys (KEKs), which are distributed on-line. 
e The lowest level has data keys (KDs), which are also distributed on-line. 


The data keys are used for bulk encryption and are changed on a per-session or per-day basis. New 
data keys are encrypted with the key-encrypting keys and distributed to the users. The key-encrypting 
keys are changed periodically and encrypted with the master key. The master keys are changed less 
often but are always distributed manually in a very secure manner. 


ANSI X9.17 defines a format for messages to establish new keys and replace old ones called CSM 
(cryptographic service messages). ANSI X9.17 also defines two-key triple-DES encryption (see 
Question B.2.6) as a method by which keys can be distributed. ANSI X9.17 is gradually being 
supplemented by public-key techniques such as Diffie-Hellman encryption (see Question B.6.1). 


One of the major limitations of ANSI X9.17 is the inefficiency of communicating in a large system 
since each pair of terminal systems that need to communicate with each other will need to have a 


Í Strictly speaking, the name of the group is Accredited Standards Committee X9; the group is accredited by 
ANSI but operated by the American Bankers Association. 
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common master key. To resolve this problem, ANSI X9.28 was developed to support the distribution 
of keys between terminal systems that do not share a common key center. The protocol defines a 
multiple-center group as two or more key centers that implement this standard. Any member of the 
multiple-center group is able to exchange keys with any other member. 


ANSI X9.30 [ANS97] is the United States financial industry standard for digital signatures based on 
the federal Digital Signature Algorithm (DSA), and ANSI X9.31 [ANS98] is the counterpart standard 
for digital signatures based on the RSA algorithm. ANSI X9.30 requires the SHA-1 hash algorithm 
encryption (see Question B.6.5); ANSI X9.31 requires the MDC-2 hash algorithm [[SO92q]. A related 
document, X9.57, covers certificate management encryption. 


ANSI X9.42 [ANS94al is a draft standard for key agreement based on the Diffie-Hellman algorithm, 
and ANSI X9.44 [ANS94b] is a draft standard for key transport based on the RSA algorithm. 
The former is intended to specify techniques for deriving a shared secret key; techniques currently 
being considered include basic Diffie-Hellman encryption (see Question 8.6.1), authenticated Diffie- 
Hellman encryption, and the MQV protocols [MQV95]. Some work to unify the various approaches 
is currently in progress. ANSI X9.44 will specify techniques for transporting a secret key with the 
RSA algorithm. It is currently based on IBM’s Optimal Asymmetric Encryption Padding, a “provably 
secure” padding technique related to work by Bellare and Rogaway [BR94]. 


ANSI X9.42 was previously part of ANSI X9.30, and ANSI X9.44 was previously part of ANSI 
X9.31. 


5.3.2 Whatare the ITU-T (CCITT) Standards? 


The International Telecommunications Union, ITU-T (formerly known as CCITT), is a multinational 
union that provides standards for telecommunication equipment and systems. ITU-T is responsible 
for standardization of elements such as the X.500 directory [CCIS8b], X.509 certificates and 
Distinguished Names. Distinguished names are the standard form of naming. Á distinguished name 
is comprised of one or mote relative distinguished names, and each relative distinguished name is 
comprised of one or more attribute-value assertions. Each attribute-value assertion consists of an 
attribute identifier and its corresponding value information, for example, “CountryName = US.” 


Distinguished names were intended to identify entities in the X.500 directory tree. A relative 
distinguished name is the path from one node to a subordinate node. The entire distinguished name 
traverses a path from the root of the tree to an end node that represents a particular entity. A goal 
of the directory was to provide an infrastructure to uniquely name every communications entity 
everywhere (hence the “distinguished” in “distinguished name”). As a result of the directory’s goals, 
names in X.509 certificates are perhaps more complex than one might like (for example, compared 
to an e-mail address). Nevertheless, for business applications, distinguished names are worth the 
complexity, as they are closely coupled with legal name registration procedures; this is something 
simple names, such as e-mail addresses, do not offer. 


X.400 

ITU-T Recommendation X.400 [CCIS8a], also known as the Message Handling System (MHS), is 
one of the two standard e-mail architectures used for providing e-mail services and interconnecting 
proprietary e-mail systems. The other is the Simple Mail Transfer Protocol (SMTP) used by the 
Internet. MHS allows e-mail and other store-and-forward message transferring such as Electronic 
business Data Interchange (EDI) and voice messaging. The MHS and Internet mail protocols are 
different but based on similar underlying architectural models. The noteworthy fact of MHS is that it 
has supported secure messaging since 1988 (though it has not been widely deployed in practice). The 
MHS message structure is similar to the MIME (see Question 6.1.1) message structure; it has both 
a header and a body. The body can be broken up into multiple parts, with each part being encoded 
differently. For example, one part of the body may be text, the next part a picture, and a third part 
encrypted information. 


X.435 

ITU-T Recommendation X.435 [CCI91] and its equivalent F.435 are X.400-based and designed to 
support EDI messaging. EDI needs more stringent security than typical e-mail because of its business 
nature: not only does an EDI message need protection against fraudulent or accidental modification 
in transit, but it also needs to be immune to repudiation after it has been sent and received. 


In support of these security requirements, X.435 defines, in addition to normal EDI messages, a set 
of EDI “notifications.” Positive notification implies the recipient has received the document and 
accepts the responsibility for it, while negative notification means the recipient refused to accept 
the document due to a specified reason. For- warding notification means the document had been 
forwarded to another recipient. Together, these notifications form the basis for a system that can 
provide security controls comparable to those in the paper-based system that EDI replaces. 


167 


168 


Frequently Asked Questions About Today's Cryptography / Chapter 5 


X.509 

ITU-T Recommendation X.509 [CCI88d] specifies the authentication service for X.500 directories, 
as well as the widely adopted X.509 certificate syntax. The initial version of X.509 was published in 
1988, version 2 was published in 1993, and version 3 was proposed in 1994 and published in 1995. 
Version 3 addresses some of the security concerns and limited flexibility that were issues in versions 
1 and 2. Directory authentication in X.509 can be carried out using either secret-key techniques or 
public-key techniques. The latter is based on public-key certificates. The standard does not specify a 
particular cryptographic algorithm, although an informative annex of the standard describes the RSA 
algorithm (see Section P.T). 


An X.509 certificate consists of the following fields: 
e version 


serial number 


signature algorithm ID 


e issuer name 


validity period 


subject (user) name 


subject public key information 


issuer unique identifier (version 2 and 3 only) 


subject unique identifier (version 2 and 3 only) 
e extensions (version 3 only) 
e signature on the above fields 


This certificate is signed by the issuer to authenticate the binding between the subject (user’s) name 
and the subject’s public key. The major difference between versions 2 and 3 is the addition of the 
extensions field. This field grants more flexibility as it can convey additional information beyond just 
the key and name binding. Standard extensions include subject and issuer attributes, certification 
policy information, and key usage restrictions, among others. 


X.509 also defines a syntax for certificate revocation lists (CRLs) (see Question f.T.3.16). The X.509 
standard is supported by a number of protocols, including PKCS (see Question and SSL (see 
Question b.1.2). 


5.3.3 Whatis PKCS? 


The Public-Key Cryptography Standards (PKCS) are a set of standards for public-key cryptography, 
developed by RSA Laboratories in cooperation with an informal consortium, originally including 
Apple, Microsoft, DEC, Lotus, Sun and MIT. The PKCS have been cited by the OIW (OSI 
Implementers’ Workshop) as a method for implementation of OSI standards. The PKCS are 
designed for binary and ASCII data; PKCS are also compatible with the ITU-T X.509 standard (see 
Question B.3.2). The published standards are PKCS #1, #3, #5, #7, #8, #9, #10 #11, #12, and 
#15; PKCS #13 and #14 are currently being developed. 


PKCS includes both algorithm-specific and algorithm-independent implementation standards. Many 
algorithms are supported, including RSA (see Section B.T) and Diffie-Hellman key exchange (see 
Question B.6.1), however, only the latter two are specifically detailed. PKCS also defines an 
algorithm-independent syntax for digital signatures, digital envelopes, and extended certificates; this 
enables someone implementing any cryptographic algorithm whatsoever to conform to a standard 
syntax, and thus achieve interoperability. 


The following are the Public-Key Cryptography Standards (PKCS): 


e PKCS #1 defines mechanisms for encrypting and signing data using the RSA public-key 
cryptosystem. 


PKCS #3 defines a Diffie-Hellman key agreement protocol. 


PKCS #5 describes a method for encrypting a string with a secret key derived from a password. 


PKCS #6 is being phased out in favor of version 3 of X.509. 


PKCS #7 defines a general syntax for messages that include cryptographic enhancements such 
as digital signatures and encryption. 


PKCS #8 describes a format for private key information. This information includes a private 
key for some public-key algorithm, and optionally a set of attributes. 


PKCS #9 defines selected attribute types for use in the other PKCS standards. 


PKCS #10 describes syntax for certification requests. 


PKCS #11 defines a technology-independent programming interface, called Cryptoki, for 
cryptographic devices such as smart cards and PCMCIA cards. 


PKCS #12 specifies a portable format for storing or transporting a user's private keys, 
certificates, miscellaneous secrets, etc. 


PKCS #13 is intended to define mechanisms for encrypting and signing data using Elliptic 
Curve Cryptography. 


PKCS #14 is currently in development and covers pseudo-random number generation. 


PKCS #15 is a complement to PKCS #11 giving a standard for the format of cryptographic 
credentials stored on cryptographic tokens. 
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It is RSA Laboratories’ intention to revise the PKCS documents from time to time to keep track 
of new developments in cryptography and data security, as well as to transition the documents into 
open standards development efforts as opportunities arise. Documents detailing the PKCS standards 
can be obtained at RSA Security’s web server, which is accessible from 

http: //www.rsasecurity.com/rsalabs/pkcs 


or via anonymous ftp to 


tps tp.rsasecurity.com/pub/pkcs/doc/|. 


Questions and comments can be directed to [pkcs-editor@rsasecurity. con]. 


5.3.4 Whatare ISO standards? 


The International Organization for Standardization, (ISO), is a non-governmental body promoting 
standardization developments globally. Altogether, ISO is broken down into about 2700 Tech- 
nical Committees, subcommittees and working groups. ISO/IEC (International Electrotechnical 
Commission) is the joint technical committee developing the standards for information technology. 


One of the more important information technology standards developed by ISO/IEC is ISO/IEC 
9798 [ISOYZa]. This is an emerging international standard for entity authentication techniques. 
It consists of five parts. Part 1 is introductory, and Parts 2 and 3 define protocols for entity 
authentication using secret-key techniques and public-key techniques. Part 4 defines protocols based 
on cryptographic checksums, and part 5 addresses zero-knowledge techniques. 


ISO/IEC 9796 is another ISO standard that defines procedures for digital signature schemes giving 
message recovery (such as RSA and Rabin-Williams). ISO/IEC International Standard 9594-8 is 
also published (and is better known) as ITU-T Recommendation X.509, “Information Technology 
-- Open Systems Interconnection -- The Directory: Authentication Framework,” and is the basic 
document defining the most widely used form of public-key certificate. 


Another example of an ISO/IEC standard is the ISO/IEC 9979 [ISO9T] standard defining the 
procedures for a service that registers cryptographic algorithms. Registering a cryptographic algorithm 
results in a unique identifier being assigned to it. The registration is achieved via a single organization 
called the registration authority. The registration authority does not evaluate or make any judgment 
on the quality of the protection provided. 


For mote information on ISO, contact their official web site 
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5.3.5 Whatis IEEE P1363? 


The IEEE P1363 is an emerging standard that aims to provide a comprehensive coverage of 
established public-key techniques. It continues to move toward completion, with the first ballot 
passed in 1999. The project, begun in 1993, has produced a draft standard covering public-key 
techniques from the discrete logarithm, elliptic curve, and integer factorization families. Contributions 
are currently solicited for an addendum, IEEE P1363a, which will cover additional public-key 


techniques. 
The project is closely coordinated with emerging ANSI standards for public-key cryptography in 


banking, and recent revisions of RSA Laboratories’ PKCS documents (for example, PKCS #1 
Version 2.0) are aligned with IEEE P1363. 


For more information, see 


5.3.6 Whatis the IETF Security Area? 


The Internet Engineering Task Force (IETF) has evolved to become the primary international forum 
for standardization of protocols used in IP networking environments. IETF activities are divided into 
several functional areas; within the Security Area, several working groups have been active in defining 
security protocols and infrastructure facilities. Extensive information on IETF work is available 
at , including working group charters, working documents (Internet- 
Drafts), and published specifications (RFCs). RFCs are issued as standards-track, Informational, 
and Experimental documents; the standards-track documents advance through three maturity levels 
(Proposed Standard, Draft Standard, and Full Standard). 


Some current and recently active IETF Security Area working groups include: 


PKIX Public-Key Infrastructure (X.509), profiling usage of X.509 certificates and CRLs and defining 
associated PKI protocols (e.g., certificate management, certificate validation) (see Question 4.1.3.1). 


IPSec IP Security Protocol, defining encapsulation and key establishment protocols for use in 
protecting messages at the IP layer (see Question 5.1.4). 


S/MIME defining the S/MIME Version 3 and related protocols for use in protecting electronic 
mail and other application messaging traffic (see Question B.T.T). 


TLS Transport Layer Security, defining the standardized successor to the widely-deployed Secure 
Sockets Layer (SSL) protocol (see Question 6.1.2). 


CAT Common Authentication Technology, defining mechanisms and interfaces (GSS-API) for 
callable integration of security services into applications (see Question 6.2.2). 


XMLDSIG XML Digital Signatures, chartered in conjunction with the World-Wide Web Consor- 
tium to define digital signature facilities for XML documents. 


SPKI Simple Public-Key Infrastructure, which has issued Experimental documents concerning 
definition and usage of certificates in a non-X.509 format. 


OPENPGP An Open Specification for Pretty Good Privacy, defining a specification for message 
and key formats as used in PGP (see Question 6.2.0). 


SSH Secure Shell, defining specifications for the Secure Shell protocol (see Question 5.1.5). 
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CHAPTER 6 
Laws Concerning Cryptography 


This chapter deals with the legal and political issues associated with cryptography, including 
government involvement, patent issues, and import and export regulations. Note that while this 
chapter includes legal information, it should not be used as a substitute for consulting an attorney 
(see below). 


6.1 LEGAL DISCLAIMER 


The materials should not be treated or relied upon as advice on technical and non-technical issues 
and the materials have not been updated to reflect recent changes in technology, the law, or any 
other areas. Furthermore, RSA cannot warrant that the information herein is complete or accurate 
and does not assume, and hereby disclaims, any liability to any person for any loss or damage caused 
by errors or omissions in the FAQ resulting from negligence, accident or any other cause. 


6.2 GOVERNMENT INVOLVEMENT 
6.2.1 Whatis NIST? 


NIST is an acronym for the National Institute of Standards and Technology, a division of the U.S. 
Department of Commerce. NIST was formerly known as the National Bureau of Standards (NBS). 
Through its Computer Systems Laboratory it aims to promote open systems and interoperability 
that will spur the development of computer-based economic activity. NIST issues standards and 
guidelines intended to be adopted in all computer systems in the U.S., and also sponsors workshops 
and seminars. Official standards are published as FIPS (Federal Information Processing Standards) 
publications. 


In 1987 Congress passed the Computer Security Act, which authorized NIST to develop standards 
for ensuring the security of sensitive but unclassified information in government computer systems. 
It encouraged NIST to work with other government agencies and private industry in evaluating 
proposed computer security standards. 


NIST issues standards for cryptographic algorithms that U.S. government agencies are required to 
use. A large percentage of the private sector often adopts them as well. In January 1977, NIST 
declared DES (see Section B.2) the official U.S. encryption standard and published it as FIPS 46; DES 
soon became a de facto standard throughout the United States. NIST is currently taking nominations 
for the Advanced Encryption Standard (AES), which is to replace DES (see Section B.3). There is no 
definite deadline for the completion of the AES (see Question B.3.3). 


Several years ago, NIST was asked to choose a set of cryptographic standards for the U.S., this has 
become known as the Capstone project (see Question 6.2.3). After a few years of rather secretive 
deliberations, NIST, in cooperation with the NSA (see Question (6.2.2), issued proposals for various 
standards in cryptography. The combination of these proposals, including digital signatures (DSS, 
see Question B.4.1) and data encryption (the Clipper chip, see Question 6.2.4), formed the Capstone 


project. 


NIST has been criticized for allowing the NSA too much power in setting cryptographic standards, 
since the interests of the NSA sometimes conflict with that of the Commerce Department and NIST. 
Yet, the NSA has much more experience with cryptography, and many mote qualified cryptographers 
and cryptanalysts than does NIST so it is perhaps unrealistic to expect NIST to forego such readily 
available assistance. 


For mote information on NIST, visit their web site at 


http://www.nist.gov/]. 


[75 


176 


Frequently Asked Questions About Today's Cryptography / Chapter 6 


6.2.2 Whatis the NSA? 


NSA is the National Security Agency, a highly secretive agency of the U.S. government created by 
Harry S. Truman in 1952. The NSA’s very existence was kept secret for many years. For a history 
of the NSA, see Bamford [Bam82]. The NSA has a mandate to listen to and decode all foreign 
communications of interest to the security of the United States. It has also used its power in various 
ways to slow the spread of publicly available cryptography in order to prevent national enemies from 
employing encryption methods that are presumably too strong for the NSA to break. 


As the premier cryptographic government agency, the NSA has huge financial and computer 
resources and employs a host of cryptographers. Developments in cryptography achieved at the 
NSA are not made public; this secrecy has led to many rumors about the NSA’s ability to break 
popular cryptosystems like DES (see Section B.2), as well as rumors that the NSA has secretly placed 
weaknesses, called “trapdoors,” in government-endorsed cryptosystems. These rumors have never 
been proved or disproved. Also the criteria used by the NSA in selecting cryptography standards 
have never been made public. 


Recent advances in the computer and telecommunications industries have placed NSA actions under 
unprecedented scrutiny, and the agency has become the target of heavy criticism for hindering U.S. 
industries that wish to use or sell strong cryptographic tools. The two main reasons for this increased 
criticism are the collapse of the Soviet Union and the development and spread of commercially 
available public-key cryptographic tools. Under pressure, the NSA may be forced to change its 
policies. 


The NSA’s charter limits its activities to foreign intelligence. However, the NSA is concerned with the 
development of commercial cryptography, since the availability of strong encryption tools through 
commercial channels could impede the NSA’s mission of decoding international communications. 
In other words, the NSA is worried that strong commercial cryptography may fall into the wrong 
hands. 


The NSA has stated that it has no objection to the use of secure cryptography by U.S. industry. It also 
has no objection to cryptographic tools used for authentication, as opposed to privacy. However, 
the NSA is widely viewed to be following policies that have the practical effect of limiting and/or 
weakening the cryptographic tools used by law-abiding U.S. citizens and corporations; see Barlow 
[Bar92] for a discussion of NSA’s effect on commercial cryptography. 


The NSA exerts influence over commercial cryptography in several ways. NSA serves as an advisor 
to the Bureau of Export Administration (BXA) at the Commerce Department, which is the front-line 
agency on export determination. In the past, BXA generally has not approved export of products 
used for encryption unless the key size is strictly limited. It did, however, approve export of any 
products used for authentication purposes only, no matter how large the key size, as long as the 
product cannot be easily converted to be used for encryption. Today the situation is different 
with dramatically relaxed restrictions on export regulations. The NSA has also blocked encryption 
methods from being published or patented, citing a national security threat; see [Lan88] for a 
discussion of this practice. 


Additionally, the NSA serves an “advisory” role to NIST in the evaluation and selection of official 
U.S. government computer security standards. In this capacity, it has played a prominent and 
controversial role in the selection of DES and in the development of the group of standards known 
as the Capstone project. The NSA can also exert market pressure on U.S. companies to produce (or 
refrain from producing) cryptographic goods, since the NSA itself is often a large customer of these 
companies. Examples of NSA-supported goods include Fortezza (see Question (6.2.0), the Defense 
Messaging System (DMS), and MISSI, the Multilevel Information System Security Initiative. 


Cryptography is in the public eye as never before and has become the subject of national public 
debate. The status of cryptography, and the NSA’s role in it, will probably continue to change over 
the next few years. 
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6.2.3 Whatis Capstone? 


Capstone has been the U.S. government’s long-term project to develop a set of standards for 
publicly available cryptography, as authorized by the Computer Security Act of 1987. The primaty 
agencies responsible for Capstone were NIST and the NSA (see Question (6.2.2). The plan called 
for the elements of Capstone to become official U.S. government standards, in which case both the 
government itself and all private companies doing business with the government would have been 
required to use Capstone. However, Capstone is no longer an active development initiative. 


There are four major components of Capstone: a bulk data encryption algorithm, a digital signature 
algorithm, a key exchange protocol, and a hash function. The data encryption algorithm is called 
Skipjack, often referred to as Clipper (see Question 6.2.4), which was the encryption chip that 
included the Skipjack algorithm. The digital signature algorithm is DSA (see Section B.4) and the 
hash function used is SHA-1 (see Question B.6.5). The key exchange protocol is not published, but 
is generally considered to be related to Diffie-Hellman (see Question B.6.1). 


The Skipjack algorithm and the concept of a Law Enforcement Access Field (LEAFs, see Ques- 
tion [7.13) have been accepted as FIPS 185; DSS has been published as FIPS 186, and finally SHS 
has been published as FIPS 180. 


All parts of Capstone were aimed at the 80-bit security level. The symmetric-keys involved were 80 


bits long and other aspects of the algorithm suite were designed to withstand an “80-bit” attack, that 


980 


is, an effort equivalent to 2°" operations. 


6.2.4 Whatis Clipper? 


Clipper chip techN ology was proposed by the U.S. Government during the mid-1990s, but is no 
longer being actively promoted for general use. The Clipper chip contains an encryption algorithm 
called Skipjack (see Question 6.2.3). Each chip contains a unique 80-bit unit key U, which is escrowed 
in two parts at two escrow agencies; both parts must be known in order to recover the key. Also 
present is a serial number and an 80-bit “family key” E; the latter is common to all Clipper chips. The 
chip is manufactured so that it cannot be reverse engineered; this means that the Skipjack algorithm 
and the keys cannot be recovered from the chip. 


As specified by the Escrowed Encryption Standard, when two devices wish to communicate, they 
first agree on an 80-bit “session key” K. The method by which they choose this key is left up to the 
implementer's discretion; a public-key method such as RSA or Diffie-Hellman seems a likely choice. 
The message is encrypted with the key K and sent (note that the key K is not escrowed.) In addition 
to the encrypted message, another piece of data, called the law-enforcement access field (LEAF, 
see Question [7.13), is created and sent. It includes the session key K encrypted with the unit key 
U, then concatenated with the serial number of the sender and an authentication string, and then, 
finally, all encrypted with the family key. The exact details of the law-enforcement access field are 
classified. The receiver decrypts the law-enforcement access field, checks the authentication string, 
and decrypts the message with the key K. 


Now suppose a law-enforcement agency wishes to “tap the line.” It uses the family key to decrypt 
the law-enforcement access field; the agency now knows the serial number and has an encrypted 
version of the session key. It presents an authorization warrant to the two escrow agencies along with 
the serial number. The escrow agencies give the two parts of the unit key to the law-enforcement 
agency, which then decrypts to obtain the session key K. Now the agency can use K to decrypt the 
actual message. Further details on the Clipper chip operation, such as the generation of the unit key, 
are sketched by Denning [Den93}. 


Matt Blaze, AT&T, showed that it is possible to modify the LEAF in a way such that law enforcement 
cannot determine where the message originally came from [Bla94). 


The Clipper chip proposal aroused much controversy and was the subject of much criticism. 
Unfortunately, two distinct issues became confused in the large volume of public comment and 
discussion. 


First there was controversy about the whole idea of escrowed keys. It is essential for the escrow 
agencies to keep the key databases extremely secure, since unauthorized access to both escrow 
databases could allow unauthorized eavesdropping on private communications. In fact, the escrow 
agencies were likely to be one of the major targets for anyone trying to compromise the Clipper 
system. The Clipper chip factory was another likely target. Those in favor of escrowed keys saw it 
as a way to provide secure communications for the public at large while allowing law-enforcement 
agencies to monitor the communications of suspected criminals. Those opposed to escrowed keys 
saw it as an unnecessary and ineffective intrusion of the government into the private lives of citizens. 
They argued that escrowed keys infringe their rights of privacy and free speech. It will take a lot of 
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time and much public discussion for society to reach a consensus on what role, if any, escrowed keys 
should have. 


The second area of controversy concerned various objections to the specific Clipper proposal, 
that is, objections to this particular implementation of escrowed keys, as opposed to the idea of 
escrowed keys in general. Common objections included: the key escrow agencies will be vulnerable 
to attack; there are not enough key escrow agencies (the current escrow agents are NIST and the 
automated systems division of the department of treasury [IDB95]); the keys on the Clipper chips 
are not generated in a sufficiently secure fashion; there will not be sufficient competition among 
implementers, resulting in expensive and slow chips; softwate implementations are not possible; and 
the key size is fixed and cannot be increased if necessaty. 


Micali [Mic¥3] has proposed an alternative system that also attempts to balance the privacy concerns 
of law-abiding citizens with the investigative concerns of law-enforcement agencies. He called his 
system fair public-key cryptography. It is similar in function and purpose to the Clipper chip proposal 
but users can choose their own keys, which they register with the escrow agencies. Also, the system 
does not require secure hardware, and can be implemented completely in software. Desmedt [Des95] 
has also developed a secure software-based key escrow system that could be a viable alternative. 
There have been numerous other proposals in the cryptographic community over the last few years; 
Denning and Branstad give a nice survey [DB95]. 


6.2.5 Whatis the Current Status of Clipper? 


Clipper has been accepted as FIPS 185 [NIS9Zal] by the federal government. Various forms of 
the Clipper chip were produced; however, it is no longer in production. The chip is still used in 
the AT&T TSD 3600 and in various Fortezza products (see Question [6.2.6), including PC Cards, 
encrypting modems, and PCI board Fortezza. All Capstone-based products have suppressed LEAF 
(see Question [7.13) escrow access function. There is now a CA (Certifying Authority) performing 


key recovety. 
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6.2.6 What is Fortezza? 


The Fortezza Crypto Card, formerly called Tessera, is a PC card (formerly PCMCIA, Personal 
Computer Memory Card International Association) developed by NSA that implements the Capstone 
algorithms. The card provides security through verification, authentication, non-repudiation, and 
encryption. 


Fortezza is intended for use with the Defense Messaging Service (DMS) and is export controlled. 
A number of vendors have announced support for the Fortezza card; NSA has also built and 
demonstrated a PKCS #11-based library (see Question B.3.3) that interfaces to the card. 


Currently, the NSA is working with companies, such as VLSL to develop commercial products 
that implement Fortezza algorithms. VLSI is devising a “Regent” chip that adds DES and RSA 
algorithms. The NSA also supports commercial development of smart card chips with Fortezza 
algorithm capability. 


6.3 PATENTS ON CRYPTOGRAPHY 
6.3.1 Is RSA patented? 


The patent for the RSA algorithm (U.S. Patent 4,405,829) was issued on September 20, 1983, 
exclusively licensed to RSA Security Inc. by the Massachusetts Institute of Technology, with an 
expiration date of September 20, 2000. RSA Security maintained a standard, royalty-based licensing 
policy that could be modified for special circumstances. In the U.S., a license has been needed to 
“make, use or sell” products that included the RSA algorithm. However, RSA Security has long 
allowed free non-commercial use of the RSA algorithm, with written permission, for academic ot 


university research purposes. 


On September 6, 2000, RSA Security made the RSA algorithm publicly available and waived its rights 
to enforce the RSA patent for any development activities that include the algorithm occurring after 
September 6, 2000. From this date forward, companies are able to develop products that incorporate 
their own implementation of the RSA algorithm and sell these products in the U.S. 


For more information on the RSA patent, see 


http: //www.rsasecurity.com/developers/total-solution/fag.html 


(Question updated 9/13/2000) 
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6.3.2 Is DSA patented? 


David Kravitz, former member of the NSA, holds a patent on DSA [KraY3]. Claus P. Schnorr 
has asserted that his patent [Sch9T] covers certain implementations of DSA. RSA Security has also 
asserted coverage of certain implementations of DSA by the Schnorr patent. 


6.3.3 Is DES patented? 


USS. Patent 3,962,539, which describes the Data Encryption Standard (DES), was assigned to IBM 
Corporation in 1976. IBM subsequently placed the patent in the public domain, offering royalty-free 
licenses conditional on adherence to the specifications of the standard. The patent expired in 1993. 
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6.3.4 Are elliptic curve cryptosystems patented? 


Elliptic curve cryptosystems, as introduced in 1985 by Neal Koblitz and Victor Miller, have no 
general patents, though some newer elliptic curve algorithms and certain efficient implementation 
techniques may be covered by patents. 

Here ate some relevant implementation patents. 


e Apple Computer holds a patent on efficient implementation of odd-characteristic elliptic 
curves, including elliptic curves over GF (p) where p is close to a power of 2. 


e Certicom holds a patent on efficient finite field multiplication in normal basis representation, 


which applies to elliptic curves with such a representation 
e Cylink also holds a patent on multiplication in normal basis 


Certicom also has two additional patents pending. The first of these covers the MQV (Menezes, Qu, 
and Vanstone) key agreement technique. Although this technique may be implemented as a discrete 
log system, a number of standards bodies are considering adoption of elliptic-curve-based variants. 
The second patent filing treats techniques for compressing elliptic curve point representations to 
achieve efficient storage in memory. 


In all of these cases, it is the implementation technique that is patented, not the prime or 
representation, and there are alternative, compatible implementation techniques that are not covered 
by the patents. One example of such an alternative is a polynomial basis implementation with 
conversion to normal basis representation where needed. (This should not be taken as a guarantee 
that there are no other patents, of course, as this is not a legal opinion.) The issue of patents and 
representations is a motivation for supporting both representations in the IEEE P1363 and ANSI 
X9.62 standards efforts. 


The patent issue for elliptic curve cryptosystems is the opposite of that for RSA and Diffie-Hellman, 
where the cryptosystems themselves have patents, but efficient implementation techniques often do 
not. 


6.3.5 What are the important patents in cryptography? 


Here is a selection of some of the important and well established patents in cryptography, including 
several expired patents of historical interest. The expiration date for patents used to be 17 years after 
issuing, but for outstandning patents as of June 8, 1995 (the day the United States ratified the GATT 
patent treaty), the expiration date is 17 years after the date of issue or 20 years after the date of filing, 
whichever is later. Today, the expiration date for U.S. patents is 20 years from filing, pursuant to the 


international standard. 


DES U.S. Patent: 3,962,539 
Filed: February 24, 1975 
Inventors: Ehrsam et al. Issued: Jane 8, 1976 


Assignee: IBM 


This patent covered the DES cipher and was placed in the public domain by IBM. It is now expired. 


Diffie-Hellman U.S. Patent: 4,200,770 
Filed: September 6, 1977 
Inventors: Hellman, Diffie, and Merkle Issued: April 29, 1980 


Assignee: Stanford University 


This is the first patent covering a public-key cryptosystem. It describes Diffie-Hellman key agreement, 
as well as a means of authentication using long-term Diffie-Hellman public keys. This patent is now 


expired. 

Public-key cryptosystems U.S. Patent: 4,218,582 
Filed: October 6, 1977 

Inventors: Hellman and Merkle Issued: August 19, 1980 


Assignee: Stanford University 


The Hellman-Merkle patent covers public-key systems based on the knapsack problem and now 
known to be insecure. Its broader claims cover general methods of public-key encryption and digital 
signatures using public keys. This patent is expired. 


RSA U.S. Patent: 4,405,829 
Filed: December 14, 1977 
Inventors: Rivest, Shamir, and Adelman Issued: September 20, 1983 


Assignee: MIT 


This patent describes the RSA public-key cryptosystem as used for both encryption and signing. It 
served as the basis for the founding of RSADSI. 


Fiat-Shamir identification U.S. Patent: 4,748,668 
Filed: July 9, 1986 
Inventors: Shamir and Fiat Issued: May 31, 1988 


Assignee: Yeda Research and Development (Israel) 


This patent describes the Fiat-Shamir identification scheme. 
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Control vectors U.S. Patent: 4,850,017 
Filed: May 29, 1987 
Inventors: Matyas, Meyer, and Brachtl Issued: July 18, 1989 


Assignee: IBM 


Patent 4,850,017 is the most prominent among a number describing the use of control vectors for 
key management. This patent describes a method enabling a description of privileges to be bound to 
a cryptographic key, serving as a deterrent to the key’s misuse. 


GQ identification U.S. Patent: 5,140,634 
Filed: October 9, 1991 
Inventors: Guillou and Quisquater Issued: August 18, 1992 


Assignee: U.S. Phillips Corporation 


This patent describes the GQ identification scheme. 


IDEA U.S. Patent: 5,214,703 
Filed: January 7, 1992 
Inventors: Lai and Massey Issued: May 25, 1993 


Assignee: Ascom Tech AG (Switzerland) 


Patent 5,214,703 covers the IDEA block cipher, an alternative to DES that employs 128-bit keys. 


DSA U.S. Patent: 5,231,668 
Filed: July 26, 1991 
Inventor: Kravitz Issued: July 27, 1993 


Assignee: United States of America 


This patent covers the Digital Signature Algorithm (DSA), the algorithm specified in the Digital 
Signature Standard (DSS) of the U.S. National Institute of Standards (NIST). 


Fair cryptosystems U.S. Patent: 5,315,658 
Filed: April 19, 1993 
Inventor: Micali Issued: May 24, 1994 


Assignee: none 


This patent covers systems in which keys are held in escrow among multiple trustees, only a specified 


quorum of which can reconstruct these keys. 


6.4 UNITED STATES CRYPTOGRAPHY EXPORT/IMPORT LAWS 


We remind the reader of the Legal Disclaimer in Section 6.1] For correct and updated information 
on United States cryptography export/import laws, contact the Bureau of Export Administration 
(BXA) ( ). 


For many years, the U.S. government did not approve export of cryptographic products unless the 
key size was strictly limited. For this reason, cryptographic products were divided into two classes: 
products with “strong” cryptography and products with “weak” (that is, exportable) cryptography. 
Weak cryptography generally means a key size of at most 56 bits in symmetric algorithms, an RSA 
modulus of size at most 512 bits, and an elliptic curve key size of at most 112 bits (see Question 6.5.3). 
It should be noted that 56-bit DES and RC5 keys have been cracked (see Question 2.4.4), as well as 
a 512-bit RSA key (see Question 2.3.0). 


In January 2000, the restrictions on export regulations were dramatically relaxed. Today, any 
cryptographic product is exportable under a license exception (that is, without a license) unless the 
end-users are foreign governments or embargoed destinations (Cuba, Iran, Iraq, Libya, North Korea, 
Serbia, Sudan, Syria, and Talisman-controlled areas of Afghanistan as of January 2000). Export to 
government end-users may also be approved, but under a license. 
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6.4.1 Can the RSA algorithm be exported from the United States? 


Export of the RSA algorithm falls under the same U.S. laws as all other cryptographic products (see 
the beginning of Section [6.4). 


Earlier, the RSA algorithm used for authentication was more easily exported than RSA used for 
ptivacy. In the former case, export was allowed regardless of key (modulus) size, although the 
exporter had to demonstrate that the product could not be easily converted to use the RSA algorithm 
for encryption. The RSA algorithm for export was generally limited to 512 bits for key management 
purposes, while the use of RSA for data encryption was generally prohibited. 


Regardless of U.S. export policy, RSA has been available abroad in non-U.S. products for several 


years. 


6.4.2 Can DES be exported from the United States? 


For a number of years, the government rarely approved the export of DES for use outside of 
the financial sector or by foreign subsidiaries of U.S. companies. Some years ago, export policy 
was liberalized to permit unrestricted exportation of DES to companies that demonstrate plans to 
implement key recovery systems in a few years. Today, export of DES is decontrolled in accordance 


with the Wassenaar Arrangement. 


Triple-DES is exportable under the regulations described in the beginning of Section 6-4. 
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6.4.3 Why is cryptography export-controlled ? 


Cryptography is export-controlled for several reasons. Strong cryptography can be used for criminal 
purposes or even as a weapon of war. During wartime, the ability to intercept and decipher enemy 
communications is crucial. For that reason, cryptographic technologies are subject to export controls. 


In accordance with the Wassenaar Arrangement (see Question [6.5.3), U.S. government agencies 
consider strong encryption to be systems that use RSA with key sizes over 512 bits or symmetric 
algorithms (such as triple-DES, IDEA, or RC5) with key sizes over 56 bits. Since government 
encryption policy is heavily influenced by the agencies responsible for gathering domestic and 
international intelligence (the FBI and NSA, respectively) the government is compelled to balance 
the conflicting requirements of making strong cryptography available for commercial purposes while 
still making it possible for those agencies to break the codes, if need be. As already mentioned several 
times in this section, the major restrictions on export regulations were eliminated in the beginning of 
the year 2000. 


To most cryptographers, the above level of cryptography -- 512 for RSA and 56 for symmetric 
algorithms -- is not considered “strong” at all. In fact, it is worth noting that RSA Laboratories has 
considered this level of cryptography to be commercially inadequate for several years. 


Government agencies often prefer to use the terms “strategic” and “standard” to differentiate 
encryption systems. “Standard” refers to algorithms that have been drafted and selected as a federal 
standard; DES is the primary example. The government defines “strategic” as any algorithm that 
requires “excessive work factors” to successfully attack. Unfortunately, the government rarely 
publishes criteria for what it defines as “acceptable” or “excessive” work factors. 


6.4.4 Are digital signature applications exportable from the United States? 


Digital signature applications are one of the nine special categories of cryptography that automatically 
fall under the more relaxed Commerce regulations; digital signature implementations using RSA key 
sizes in excess of 512 bits were exportable even before the year 2000. However, there were some 
restrictions when developing a digital signature application using a reversible algorithm (that is, the 
signing operation is sort of the reverse operation for encryption), such as RSA. In this case, the 
application should sign a hash of the message, not the message itself. Otherwise, the message had to 
be transmitted with the signature appended. If the message was not transmitted with the signature, 
the NSA considered this quasi-encryption and the State controls would apply. 
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6.5 CRYPTOGRAPHY EXPORT/IMPORT LAWS IN OTHER COUNTRIES 
6.5.1 What are the cryptographic policies of some countries? 


This section gives a very brief description of the cryptographic policies in twelve countries. We 
emphasize that the laws and regulations are continuously changing, and the information given here 
is not necessarily complete or accurate. For example, export regulations in several countries are 
likely to change in the near future in accordance with the new U.S. policy. Moreover, some countries 
might have different policies for tangible and intangible products; intangible products are products 
that can be downloaded from the Internet. Please consult with export agencies or legal firms with 
multi-national experience in order to comply with all applicable regulations. 


Australia The Australian government has been critized for its lack of coordination in establishing 
a policy concerning export, import, and domestic use of cryptographic products. Recent clarifications 
state that there are no restrictions on import and domestic use, but that export is controlled by the 
Department of Defense in accordance with the Wassenaar Arrangement. 


Brazil While there are no restrictions of any kind today, there are proposals for a new law requiring 
users to register their products. Brazil is not part of the Wassenaar Arrangement. 


Canada There are no restrictions on import and domestic use of encryption products in Canada 
today . The Canadian export policy is in accordance with the policies of countries such as 
United States, United Kingdom, and Australia in the sense that Canada’s Communications Security 
Establishment (CSE) cooperates with the corresponding authorities in the mentioned countries. 


China China is one of the countries with the strongest restrictions on cryptography; a license 
is required for export, import, or domestic use of any cryptography product. There are several 
restrictions on export regulations, and China is not participating in the Wassenaar Arrangement. 


The European Union The European Union strongly supports the legal use of cryptography and 
is at the forefront of counteracting restrictions on cryptography as well as key escrow and recovery 
schemes. While this policy is heavily encouraged by Germany, there are a variety of more restrictive 
policies among the other member states. 


e France France used to have strong restrictions on import and domestic use of encryption 
products, but the most substantial restrictions were abolished in early 1999. Export regulations 
are pursuant to the Wassenaar Arrangement and controlled by Service Central de la Securite 
des Systemes d’Information (SCSSIJ). 


e Germany There are no restrictions on the import or use of any encryption software or 
hardware. Furthermore, the restrictions on export regulations were removed in June 1999. 


e Italy While unhindered use of cryptography is supported by the Italian authorities, there have 
been proposals for cryptography controls. There are no import restrictions, but export is 
controlled in accordance with the Wassenaar Arrangement by the Ministry of Foreign Trade. 


e United Kingdom The policy of United Kingdom is similar to that of Italy, but with even 
more outspoken proposals for new domestic cryptography controls. Export is controlled by 
the Department of Trade and Industry. 


Israel Domestic use, export, and import of cryptographic products are tightly controlled in Israel. 
There have been proposals for slight relaxations of the regulations, but only for cryptographic 
products used for authentication purposes. 


Japan ‘There ate no restrictions on the import or use of encryption products. Export is controlled 
in accordance with the Wassenaar Arrangement by the Security Export Control Division of the 
Ministry of International Trade and Industry. 


Russia The Russian policy is similar to the policies of China and Israel with licenses requited 
for import and domestic use of encryption products. Unlike those countries, however, Russia is a 
participant of the Wassenaar Arrangement. Export of cryptographic products from Russia generally 
requires a license. 


South Africa There are no restrictions on the domestic use of cryptography, but import of 
ctyptographic products requires a valid permit from the Armaments Control Division. Export is 
controlled by the Department of Defense Armaments Development and Protection. South Africa 
does not participate in the Wassenaar Arrangement. 


In the table below, 75 countries have been divided into five categories according to their cryptographic 
policies as of 1999. Category 1 includes countries with a policy allowing for unrestricted use of 
ctyptography, while category 5 consists of countries where cryptography is tightly controlled. The 
table and most other facts in this answer are collected from [IEPIC99], which includes extensive lists 
of references. Countries with their names in 7talics are participants in the Wassenaar Arrangement 


(see Question (6.5.3). 


1 | Canada, Chile, Croatia, Cyprus, Dominica, Estonia, Germany, Iceland, Indonesia, 
Ireland, Kuwait, Kreystan, Latvia, Lebanon, Lithuania, Mexico, Morocco, 

Papua New Guinea, Philippines, Slovenia, Sri Lanka, Switzerland, Tanzania, Tonga, 
Uganda, United Arab Emirates. 

2 | Argentina, Armenia, Australia, Austria, Belgium, Brazil, Bulgaria, Czech Republic, 
Denmark, Finland, France, Greece, Hungary, Italy, Japan, Kenya, South Korea, 
Luxembourg, Netherlands, New Zeeland, Norway, Poland, Portugal, Romania, 

South Africa, Sweden, Taiwan, Turkey, Ukraine, Uruguay. 

3 | Hong Kong, Malaysia, Slovakia, Spain, United Kingdom, United States. 


4 | India, Israel, Saudi Arabia. 


5 | Belarus, China, Kazakhstan, Mongolia, Pakistan, Russia, Singapore, Tunisia, 
Venezuela, Vietnam. 
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6.5.2 Why do some countries have import restrictions on cryptography? 


As indicated in the answer to Question several countries including China, Israel, and Russia 
have import restrictions on cryptography. Some countries require vendors to obtain a license 
before importing cryptographic products. Many governments use such import licenses to pursue 
domestic policy goals. In some instances, governments require foreign vendors to provide technical 
information to obtain an import license. This information is then used to steer business toward 
local companies. Other governments have been accused of using this same information for outright 
industrial espionage. 


6.5.3 Whatis the Wassenaar Arrangement? 


The Wassenaar Arrangement (WA) was founded in 1996 by a group of 33 countries including United 
States, Russia, Japan, Australia, and the members of the European Union. Its purpose is to control 
exports of conventional weapons and sensitive dual-use technology, which includes cryptographic 
products; “dual-use” means that a product can be used for both commercial and military purposes. 
The Wassenaar Arrangement controls do not apply to so-called intangible products, which include 
downloads from the Internet. 


WA is the successor of the former Coordinating Committee on Multilateral Export Controls 
(COCOM), which placed export restrictions to communist countries. It should be emphasized that 
WA is not a treaty or a law; the WA Control lists are merely guidelines and recommendations, 
and each participating state may adjust its export policy through new regulations. Indeed, there are 
substantial differences between the export regulation policies of the participating countries. 


As of the latest revision in December 1999, WA controls encryption and key management products 
where the security is based on one or several of the following: 


e A symmetric algorithm with a key size exceeding 56 bits. 
e Factorization of an integer of size exceeding 512 bits. 


e Computation of discrete logarithms in a multiplicative group of a field of size is excess of 512 
bits. 


e Computation of discrete logarithms in a group that is not part of a field, where the size of the 
group exceeds 112 bits. 


Other products, including products based on single-DES, are decontrolled. For more information 


on the Wassenaar Arrangement, see [http: //www.wassenaar.org/]. 
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CHAPTER 7 
Miscellaneous Topics 


This chapter contains additional applications of cryptography, including new technologies and 
techniques as well as some of the more important older techniques and applications. 


7.1 Whatis probabilistic encryption? 


Probabilistic encryption, developed by Goldwasser and Micali [(3M84], is a design approach for 
encryption where a message is encrypted into one of many possible ciphertexts (not just a single 
ciphertext as in deterministic encryption). This is done in such a way that it is provably as hard to 
obtain partial information about the message from the ciphertext as it is to solve some hard problem. 
In previous approaches to encryption, even though it was not always known whether one could 
obtain such partial information, it was not proved that one could not do so. 


A particular example of probabilistic encryption given by Goldwasser and Micali operates on “bits” 
rather than “blocks” and is based on the quadratic residuosity problem. The problem is to find 
whether an integer x is a square modulo a composite integer n. (This is easy if the factors of n are 
known, but presumably hard if they are not.) In their example, a “0” bit is encrypted as a random 
square, and a “1” bit as a non-squate; thus it is as hard to decrypt as it is to solve the quadratic 
residuosity problem. The scheme has substantial message expansion due to the bit-by-bit encryption 
of the message. Blum and Goldwasser later proposed an efficient probabilistic encryption scheme 
with minimal message expansion [BG85]. 


7.2 What are special signature schemes? 


Since the time Diffie and Hellman introduced the concept of digital signatures (see Question 2.2.2), 
many signature schemes have been proposed in cryptographic literature. These schemes can be 
categorized as either conventional digital signature schemes (for example, RSA and DSA) or special 
signature schemes depending on their security features. In a conventional signature scheme (the 
original model defined by Diffie and Hellman), we generally assume the following situation: 


e The signer knows the contents of the message that he has signed. 


e Anyone who knows the public key of the signer can verify the correctness of the signature at 
any time without any consent or input from the signer. (Digital signature schemes with this 
property are called self-authenticating signature schemes.) 


e The security of the signature schemes is based on certain complexity-theoretic assumptions. 


In some situations, it may be better to relax some of these assumptions, and/or add certain special 
security features. For example, when Alice asks Bob to sign a certain message, she may not want 
him to know the contents of the message. In the past decade, a variety of special signature schemes 
have been developed to fit other security needs that might be desired in different applications. 
Questions [7.3] through [7.8] deal with some of these special signature schemes. 
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7.3 Whatis a blind signature scheme? 


Blind signature schemes, first introduced by Chaum [Cha83] [Cha85], allow a person to get a message 
signed by another party without revealing any information about the message to the other party. 


Using RSA signatures (see Question B.1.1), Chaum demonstrated the implementation of this concept 
as follows: Suppose Alice has a message m that she wishes to have signed by Bob, and she does not 
want Bob to learn anything about m. Let (n,e) be Bob's public key and (n, d) be his private key. 
Alice generates a random value r such that gcd(r, n) = 1 and sends ñ = (r°m) mod n to Bob. The 
value m is “blinded” by the random value r; hence Bob can derive no useful information from it. 
Bob returns the signed value § = M* mod n to Alice. Since 


Mİ = (rem) =rm* (mod n), 


Alice can obtain the true signature s of m by computing s = r~'§ mod n. 


Now Alice’s message has a signature she could not have obtained on her own. This signature scheme 
is secure provided that factoring and root extraction remains difficult. However, regardless of the 
status of these problems the signature scheme is unconditionally “blind” since r is random. The 
random r does not allow the signer to learn about the message even if the signer can solve the 
underlying hard problems. 


There are potential problems if Alice can give an arbitrary message to be signed, since this effectively 
enables her to mount a chosen message attack. One way of thwarting this kind of attack is described 


in [CEN88]. 


Blind signatures have numerous uses including timestamping (see Question [7.1T), anonymous access 
control, and digital cash (see Question K.2.T). Thus it is not surprising there are now numerous 
variations on the blind signature theme. Further work on blind signatures has been carried out in 


recent years [EY94] [SPC95]. 


7.4 Whatis a designated confirmer signature? 


A designated confirmer signature [Cha94] strikes a balance between self-authenticating digital 
signatures (see Question and zero-knowledge proofs (see Question 2.1.8). While the former 
allows anybody to verify a signature, the latter can only convince one recipient at a time of the 
authenticity of a given document, and only through interaction with the signer. A designated 
confirmer signature allows certain designated parties to confirm the authenticity of a document 
without the need for the signer’s input. At the same time, without the aid of either the signer or the 
designated parties, it is not possible to verify the authenticity of a given document. Chaum developed 
implementations of designated confirmer signatures with one or more confirmers using RSA digital 


signatures (see Question B.T.T). 
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7.5 Whatis a fail-stop signature scheme? 


A fail-stop signature scheme is a type of signature devised by van Heyst and Pederson [WPY2] to 
protect against the possibility that an enemy may be able to forge a person’s signature. It is a variation 
of the one-time signature scheme (see Question [7.7), in which only a single message can be signed 
and protected by a given key at a time. The scheme is based on the discrete logarithm problem. In 
particular, if an enemy can forge a signature, then the actual signer can prove that forgery has taken 
place by demonstrating the solution of a supposedly hard problem. Thus the forger’s ability to solve 
that problem is transferred to the actual signet. 


The term “fail-stop” refers to the fact that a signer can detect and stop failures, that is, forgeries. 
Note that if the enemy obtains an actual copy of the signer’s private key, forgery cannot be detected. 
What the scheme detects are forgeries based on cryptanalysis. 


7.6 Whatis a group signature? 


A group signature, introduced by Chaum and van Heijst [UV9T], allows any member of a group to 
digitally sign a document in a manner such that a verifier can confirm that it came from the group, 
but does not know which individual in the group signed the document. The protocol allows for 
the identity of the signer to be discovered, in case of disputes, by a designated group authority that 
has some auxiliary information. Unfortunately, each time a member of the group signs a document, 
a new key pair has to be generated for the signer. The generation of new key pairs causes the 
length of both the group members’ secret keys and the designated authority’s auxiliary information to 
grow. This tends to cause the scheme to become unwieldy when used by a group to sign numerous 
messages or when used for an extended period of time. Some improvements [CP94] [CP95] have 
been made in the efficiency of this scheme. 
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7.7 Whatis a one-time signature scheme? 


A one-time signature scheme allows the signature of only a single message using a given piece of 
private (and public) information. One advantage of such a scheme is that 1t is generally quite fast. 
However, the scheme tends to be unwieldy when used to authenticate multiple messages because 
additional data needs to be generated to both sign and verify each new message. By contrast, with 
conventional signature schemes like RSA (see Question B.T.T), the same key pair can be used to 
authenticate multiple documents. There is a relatively efficient implementation of one-time-like 
signatures by Merkle called the Merkle Tree Signature Scheme (see Question B.6.9), which does not 
require new key pairs for each message. 


7.8 Whatis an undeniable signature scheme? 


Undeniable signature scheme, devised by Chaum and van Antwerpen [VIO] [UV 92], are non-self- 
authenticating signature schemes (see Question [7.2), where signatures can only be verified with the 
signer’s consent. However, if a signature is only verifiable with the aid of a signer, a dishonest signet 
may refuse to authenticate a genuine document. Undeniable signatures solve this problem by adding 
a new component called the disavowal protocol in addition to the normal components of signature 
and verification. 


The scheme is implemented using public-key cryptography based on the discrete logarithm problem 
(see Question 2.3.7). The signature part of the scheme is similar to other discrete logarithm signature 
schemes. Verification is carried out by a challenge-response protocol where the verifier, Alice, sends 
a challenge to the signer, Bob, and views the answer to verify the signature. The disavowal process is 
similar; Alice sends a challenge and Bob’s response shows that a signature is not his. (If Bob does 
not take part, it may be assumed that the document is authentic.) The probability that a dishonest 
signer is able to successfully mislead the verifier in either verification or disavowal is 1/p where p is 
the prime number in the signer’s private key. If we consider the average 768-bit private key, there is 
only a minuscule probability that the signer will be able to repudiate a document they have signed. 
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7.9 What are on-line/off-line signatures? 


On-line/off-line signature schemes ate a way of getting around the fact that many general-purpose 
digital signature schemes have high computational requirements. On-line/off-line schemes are 
created by joining together a general-purpose signature scheme (see Question and a one-time 
signature scheme (see Question [/.7} in such a way that the bulk of the computational burden for a 
signature operation can be performed before the signer knows the message that will be signed. 


More precisely, let a general-purpose digital signature scheme and a one-time signature scheme be 
fixed. These schemes can be used together to define an on-line/off-line signature scheme which 
works as follows: 


1. Key pair generation. A public/private key pair Kp/Ks for the general-purpose signature scheme 
is generated. These are the public and private keys for the on-line/off-line scheme as well. 


2. Offline phase of signing. A public/private key pair Tp/Ts for the one-time signature scheme is 
generated. The public key Tp for the one-time scheme is signed with the private key Ks for 
the general-purpose scheme to produce a signature Sx (Tp). 


3. On-line phase of signing. To sign a message m, use the one-time scheme to sign m with the private 
key Ts, computing the value Sy (m). The signature of m is then the triple (Tp, Sx (Tp), Sr(m)). 


Note that steps 2 and B] must be performed for each message signed; however, the point of using 
an on-line/off-line scheme is that step P] can be performed before the message m has been chosen 
and made available to the signer. An on-line/off-line signature scheme can use a one-time signature 
scheme that is much faster than a general-purpose signature scheme, and this can make digital 
signatures much mote practical in a variety of scenarios. An on-line/off-line signature scheme can 
be viewed as the digital signature analog of a digital envelope (see Question 2.2.4). 


For more information about on-line/offline signatures, see [EGM89}. 


7.10 Whatis OAEP? 


Optimal Asymmetric Encryption Padding (OAEP) is a method for encoding messages developed by 
Mihir Bellare and Phil Rogaway [BR94]. The technique of encoding a message with OAEP and then 
encrypting it with RSA is provably secure in the random oracle model. Informally, this means that if 
hash functions are truly random, then an adversary who can recover such a message must be able to 
break RSA. 


An OAEP encoded message consists of a “masked data” string concatenated with a “masked random 
number”. In the simplest form of OAEP, the masked data is formed by taking the XOR of the 
plaintext message M and the hash G of a random string r. The masked random number is the XOR 
of r with the hash H of the masked data. The input to the RSA encryption function is then 


[M e G(r)] || [ro H(M e G(r))] 


Often, OAEP is used to encode small items such as keys. There are other variations on OAEP 
(differing only slightly from the above) that include a feature called “plaintext-awareness”. This 
means that to construct a valid OAEP encoded message, an adversary must know the original 
plaintext. To accomplish this, the plaintext message M is first padded (for example, with a string of 
zeroes) before the masked data is formed. OAEP is supported in the ANSI X9.44, IEEE P1363 and 
SET standards. 


Frequently Asked Questions About Today's Cryptography / Chapter 7 


7.11 Whatis digital timestamping? 


Consider two questions that may be asked by a computer user as he or she views a digital document 
or on-line record: 


(1) Who is the author of this record -- who wrote it, approved it, or consented to it? 
(2) When was this record created or last modified? 


In both cases, the question is about exactly this record -- exactly this sequence of bits. An answer to 
the first question tells who and what: Who approved exactly what is in this record? An answer to the 
second question tells when and what: When exactly did the contents of this record first exist? 


Both of the above questions have good solutions. A system for answering the first question is called 
a digital signature scheme (see Question 2.2.2). A system for answering the second question is called 
a digital timestamping scheme. Such systems are described in [BHS93] and [[HS91]. 


Any system allowing users to answer these questions must include two procedures. First, there 
must be a signing procedure with which (1) the author of a record can “sign” the record, or (2) any 
user can fix a record in time. The result of this procedure is a string of bytes that serves as the 
signature. Second, there must be a verification procedure by which any user can check a record and 
its purported signature to make sure it correctly answers (1) who and what? or (2) when and what? 


about the record in question. 


The signing procedure of a digital timestamping system often works by mathematically linking the bits 
of the record to a “summary number” that is widely witnessed by and widely available to members 
of the public -- including, of course, users of the system. The computational methods employed 
ensure that only the record in question can be linked, according to the “instructions” contained in 
its timestamp certificate, to this widely witnessed summary number; this is how the particular record 
is tied to a particular moment in time. The verification procedure takes a particular record and a 
putative timestamp certificate for that record and a particular time, and uses this information to 
validate whether that record was indeed certified at the time claimed by checking it against the widely 
available summary number for that moment. 


One nice thing about digital timestamps is that the document being timestamped does not have to 
be released to anybody to create a timestamp. The originator of the document computes the hash 
values himself, and sends them in to the timestamping service. The document itself is only needed 
for verifying the timestamp. This is very useful for many reasons (like protecting something that you 
might want to patent). 


Two features of a digital timestamping system are particularly helpful in enhancing the integrity of a 
digital signature system. First, a timestamping system cannot be compromised by the disclosure of a 
key. This is because digital timestamping systems do not rely on keys, or any other secret information, 
for that matter. Second, following the technique introduced in [BHS93}, digital timestamp certificates 
can be renewed so as to remain valid indefinitely. 


With these features in mind, consider the following situations. 


It sometimes happens that the connection between a person and his or her public signature key must 
be revoked. For example, the user's private key may accidentally be compromised, or the key may 
belong to a job or role in an organization that the person no longer holds. Therefore the person-key 
connection must have time limits, and the signature verification procedure should check that the 
record was signed at a time when the signer’s public key was indeed in effect. And thus when a 
user signs a record that may be checked some time later -- perhaps after the user's key is no longer 
in effect -- the combination of the record and its signature should be certified with a secure digital 
timestamping service. 


There is another situation in which a user's public key may be revoked. Consider the case of the signer 
of a particularly important document who later wishes to repudiate his signature. By dishonestly 
reporting the compromise of his private key, so that all his signatures are called into question, the 
user is able to disavow the signature he regrets. However, if the document in question was digitally 
timestamped together with its signature (and key-revocation reports are timestamped as well), then 
the signature cannot be disavowed in this way. This is the recommended procedure, therefore, in 
order to preserve the non-reputability desired of digital signatures for important documents. 


The statement that private keys cannot be derived from public keys is an over-simplification of a 
more complicated situation. In fact, this claim depends on the computational difficulty of certain 
mathematical problems. As the state of the art advances -- both the current state of algorithmic 
knowledge, as well as the computational speed and memory available in currently available computers 
-- the maintainers of a digital signature system will have to make sure that signers use longer and 
longer keys. But what is to become of documents that were signed using key lengths that are no 
longer considered secure? If the signed document is digitally timestamped, then its integrity can be 
maintained even after a particular key length is no longer considered secure. 

Of course, digital timestamp certificates also depend for their security on the difficulty of certain 
computational tasks concerned with hash functions (see Question 2.1.6). (All practical digital signature 
systems depend on these functions as well.) The maintainers of a secure digital timestamping service 
will have to remain abreast of the state of the art in building and in attacking one-way hash functions. 
Over time, they will need to upgrade their implementation of these functions, as part of the process 
of renewal [BHS93]. This will allow timestamp certificates to remain valid indefinitely. 
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7.12 Whatis key recovery? 


One of the barriers to the widespread use of encryption in certain contexts is the fact that when a 
key is somehow “lost”, any data encrypted with that key becomes unusable. Key recovery is a general 
term encompassing the numerous ways of permitting “emergency access” to encrypted data. 


One common way to perform key recovery, called key escrow, is to split a decryption key (typically 
a secret key or an RSA private key) into one or several parts and distribute these parts to escrow 
agents or “trustees”. In an emergency situation (exactly what defines an “emergency situation” is 
context-dependent), these trustees can use their “shares” of the keys either to reconstruct the missing 
key or simply to decrypt encrypted communications directly. This method was used by Security 
Dynamics’ RSA SecurPC product. 


Another recovery method, called key encapsulation, is to encrypt data in a communication with a 
“session key” (which varies from communication to communication) and to encrypt that session key 
with a trustee’s public key. The encrypted session key is sent with the encrypted communication, and 
so the trustee is able to decrypt the communication when necessary. A vatiant of this method, in 
which the session key is split into several pieces, each encrypted with a different trustee’s public key, 
is used by TIS’ RecoverKey. 


Dorothy Denning and Dennis Branstad have written a survey of key recovery methods [IDB99]. 


Key recovery first gained notoriety as a potential work-around to the United States Government’s 
policies on exporting “strong” cryptography. To make a long story short, the Government agreed to 
permit the export of systems employing strong cryptography as long as a key recovery method that 
permits the Government to read encrypted communications (under appropriate circumstances) was 
incorporated. For the Government’s purposes, then, “emergency access” can be viewed as a way of 
ensuring that the Government has access to the plaintext of communications it is interested in, rather 
than as a way of ensuring that communications can be decrypted even if the required key is lost. 


Key recovery can also be performed on keys other than decryption keys. For example, a user's 
private signing key might be recovered. From a security point of view, however, the rationale for 
recovering a signing key is generally less compelling than that for recovering a decryption key; the 
recovery of a signing key by a third party might nullify non-repudiation. 


7.13 Whatare LEAFs? 


A LEAF, or Law Enforcement Access Field, is a small piece of “extra” cryptographic information 
that is sent or stored with an encrypted communication to ensure that appropriate Government 
entities, or other authorized parties, can obtain the plaintext of some communication. For a typical 
escrowed communication system, a LEAF might be constructed by taking the decryption key for 
the communication, splitting it into several shares, encrypting each share with a different key escrow 
agent's public key, and concatenating the encrypted shares together. 


The term “LEAF” originated with the Clipper Chip (see Question for more information). 
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7.14 Whatis PSS/PSS-R? 


PSS (Probabilistic Signature Scheme) is a provably secure way of creating signatures with RSA (see 
Question due to Mihir Bellare and Phillip Rogaway [BRY90]. Informally, a digital signature 
scheme is provably secure if its security can be tied closely to that of an underlying cryptographic 
ptimitive. The proof of security for PSS takes place in the random oracle model, in which hash 
functions are modeled as being truly random functions. Although this model is not realistically 
attainable, there is evidence that practical instantiations of provably secure schemes are still better 
than schemes without provable security [BR93]. The method for creating digital signatures with RSA 
that is described in PKCS #1 (see Question 5.3.3) has not been proven secure even if the underlying 
RSA primitive is secure; in contrast, PSS uses hashing in a sophisticated way to tie the security of the 
signature scheme to the security of RSA. 


To minimize the length of communications, it is often desirable to have signature schemes in which 
the message can be “folded” into the signature. Schemes that accomplish this are called message 
recovery signature schemes. PSS-R is a message recovery variant of PSS with the same provable 
security. 


Standards efforts related to PSS and PSS-R are underway in several forums, including ANSI X9F1, 
IEEE P1363, ISO/IEC JTC1 SC27, and PKCS. 


7.15 What are covert channels? 


Covert communication channels (also called subliminal channels) are often motivated as being 
solutions to the “prisoners’ problem.” Consider two prisoners in separate cells who want to exchange 
messages, but must do so through the warden, who demands full view of the messages (that is, 
no encryption). A covert channel enables the prisoners to exchange secret information through 
messages that appear to be innocuous. A covert channel requires prior agreement on the part of 
the prisoners. For example if an odd length word corresponds to “1” and an even length word 
corresponds to “0”, then the previous sentence contains the subliminal message “101011010011”. 


An important use of covert channels is in digital signatures. If such signatures are used, a prisoner 
can both authenticate the message and extract the subliminal message. Gustavus Simmons [Sim93all 
devised a way to embed a subliminal channel in DSA (see Section B.4) that uses all of the available 
bits (that is, those not being used for the security of the signature), but requires the recipient to have 
the sender’s secret key. Such a scheme is called broadband and has the drawback that the recipient is 
able to forge the sendet’s signature. Simmons [Sim93b] also devised schemes that use fewer of the 
available bits for a subliminal channel (called narrowband schemes) but do not require the recipient 


to have the sendet’s secret key. 
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7.16 Whatare proactive security techniques? 


Proactive security combines the ideas of distributed cryptography (also called secret sharing) (see 
Question 2.1.9). with the refreshment of secrets. The term proactive refers to the fact that it’s 
not necessaty for a breach of security to occur before secrets are refreshed, the refreshment is 
done periodically (and hence, proactively). Key refreshment is an important addition to distributed 
ctyptography because without it, an adversary who is able to recover all the distributed secrets given 
enough time will eventually be successful in breaking the system. For example, consider the following 
proactive version of Shamir's secret sharing scheme (see Question B.6.12): 


A polynomial 


folt) = ao + a£ +--*+am rr? 


over GF(q) is constructed, and the secret is ag. From the beginning, each user has a point 
(xi, fo(xi)) with z; 4 0. For the first key refreshment, a new polynomial fı is constructed from 
fo. More generally, for the kth key refreshment, a polynomial fk+ı is constructed from fp. 
The polynomial fk+ı is equal to fk + gk, where gp is a random (m — 1)-degree polynomial with 
gk(0) = 0. After each key refreshment the secret is unchanged, but user is new secret share is 
(£i, fr+1(£1)) = (zi, fk(xi) + gx (avi)). An adversary who knows less than m current secret shares at 
any particular time knows nothing about the secret. 


More recent techniques in proactive security include proactive RSA [EGM97] and proactive signatures 


(see [GJK9G] and [HJJ97]). For an overview of proactive techniques see [(CGHY7]. 


7.17 Whatis quantum computing? 


Quantum computing [Ben82] [Fey82] [Fey86] [DeuYA] is a new field in computer science that has 


been developed with our increased understanding of quantum mechanics. It holds the key to 
computers that are exponentially faster than conventional computers (for certain problems). A 
quantum computer is based on the idea of a quantum bit or qubit. In classical computers, a bit 
has a discrete range and can represent either a zero state or a one state. A qubit can be in a linear 
superposition of the two states. Hence, when a qubit is measured the result will be zero with a certain 
probability and one with the complementary probability. A quantum register consists of n qubits. 
Because of superposition, a phenomenon known as quantum parallelism allows exponentially many 
computations to take place simultaneously, thus vastly increasing the speed of computation. 


Quantum interference, the analog of Young’s double-slit experiment that demonstrated constructive 
and destructive interference phenomena of light, is one of the most significant characteristics of 
quantum computing. Quantum interference improves the probability of obtaining a desired result 
by constructive interference and diminishes the probability of obtaining an erroneous result by 
destructive interference. Thus, among the exponentially many computations, the correct answer can 
theoretically be identified with appropriate quantum “algorithms.” 


It has been proven [Sho94] that a quantum computer will be able to factor (see Question 2.3.3) and 
compute discrete logarithms (see Question 2.3.7) in polynomial time. Unfortunately, the development 
of a practical quantum computer still seems far away because of a phenomenon called quantum 
decoherence, which is due to the influence of the outside environment on the quantum computer. 


Brassard has written a number of helpful texts in this field [Bra95al] [Bra95b] (Bra95c). 


Quantum cryptography (see Question is quite different from, and currently more viable than, 
quantum computing. 
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7.18 Whatis quantum cryptography? 


Quantum cryptography [BBBYA] [Bra93] is a method for secure key exchange over an insecure 
channel based on the nature of photons. Photons have a polarization, which can be measured in any 


basis, where a basis consists of two directions orthogonal to each other, as shown in Figure [7.1] 


rectilinear diagonal 
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Figure 7.1: Bases. 


If a photon’s polarization is read in the same basis twice, the polarization will be read correctly and 
will remain unchanged. If it is read in two different bases, a random answer will be obtained in 
the second basis, and the polarization in the initial basis will be changed randomly, as shown in 


Figure [7.4 
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Figure 7.2: Polarization readings. 


The following protocol can be used by Alice and Bob to exchange secret keys. 


e Alice sends Bob a stream of photons, each with a random polarization, in a random basis. She 
recotds the polarizations. 


e Bob measures each photon in a randomly chosen basis and records the results. 


Bob announces, over an authenticated but not necessarily private channel (for example, by 
telephone), which basis he used for each photon. 


Alice tells him which choices of bases are correct. 


e The shared secret key consists of the polarization readings in the correctly chosen bases. 


Quantum cryptography has a special defense against eavesdropping: If an enemy measures the 
photons during transmission, he will use the wrong basis half the time, and thus will change some of 
the polarizations. That will result in Alice and Bob having different values for their secret keys. As a 
check, they can exchange some random bits of their key using an authenticated channel. They will 
therefore detect the presence of eavesdropping, and can start the protocol over. 


There has been experimental work in developing such systems by IBM and British Telecom. 
For information on quantum computing (which is not the same as quantum cryptography), see 


Question [7.T7, 
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7.19 Whatis DNA computing? 


DNA computing, also known as molecular computing, is a new approach to massively parallel 
computation based on groundbreaking work by Adleman. He used DNA to solve a seven-node 
Hamiltonian path problem, a special case of an NP-complete problem that attempts to visit every 
node in a graph exactly once. (This special case is trivial to solve with a conventional computer, or 
even by hand, but illustrates the potential of DNA computing.) 


A DNA computer is basically a collection of specially selected DNA strands whose combinations 
will result in the solution to some problem. Technology is currently available both to select the initial 
strands and to filter the final solution. The promise of DNA computing is massive parallelism: with 
a given setup and enough DNA, one can potentially solve huge problems by parallel search. This 
can be much faster than a conventional computer, for which massive parallelism would require large 
amounts of hardware, not simply more DNA. 


Research on DNA computing is ongoing; Lipton and Adleman [AdI95] have extended on 
Adleman’s original work with more efficient designs of possible DNA computers. 


The impact of DNA computing on cryptography remains to be determined. Beaver [Bea95] has 
estimated that to factor a 1000-bit number following Adleman’s original approach, the required 
amount of solution would be 10290000 liters. However, Adleman has observed that a DNA computer 
sufficient to search for 2% DES keys would occupy only a small set of test tubes [[AdI9G]. In any 
case, DNA computing is just classical computing, albeit highly parallelized; thus with a large enough 
key, one should be able to thwart any DNA computer that can be built. With quantum computing 
(see Question [7.17), on the other hand, factoring can theoretically be done in (quantum) polynomial 
time, so quantum computing might be viewed with more concern than DNA computing. 


7.20 Whatare biometric techniques? 


The term biometrics applies to a broad range of electronic techniques that employ the physical 
characteristics of human beings as a means of authentication. In a sense, human beings already 
routinely authenticate one another biometrically: confirming the identity of a friend on the telephone 
by the sound of his or her voice is a simple instance of this. A number of biometric techniques 
have been proposed for use with computer systems. These include (among a wide variety of others) 
fingerprint readers, iris scanners, face imaging devices, hand geometry readers, and voice readers. 
Usage of biometric authentication techniques is often recommended in conjunction with other user 
authentication methods, rather than as a single, exclusive method. 


Fingerprint readers are likely to become a common form of biometric authentication device in the 
coming years. To identify herself to a server using a fingerprint reader, a user places her finger 
on a small reading device. This device measures various characteristics of the patterns associated 
with the fingerprint of the user, and typically transmits these measurements to a server. The server 
compares the measurements taken by the reader against a registered set of measurements for the 
user. The server authenticates the user only if the two sets of measurements correspond closely to 
one another. One significant characteristic of this and other biometric technologies is that matching 
must generally be determined on an approximate basis, with parameters tuned appropriately to make 
the occurrence of false positive matches or false negative rejections acceptably infrequent. 
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7.21 Whatis tamper-resistant hardware? 


One part of designing a secure computer system is ensuring that various cryptographic keys can be 
accessed only by their intended user(s) and only for their intended purposes. Keys stored inside a 
computer can be vulnerable to use, abuse, and/or modification by an unauthorized attacker. 


Por a variety of situations, an appropriate way to protect keys is to store them in a tamper-resistant 
hardware device. These devices can be used for applications ranging from secure e-mail to electronic 
cash and credit cards. They offer physical protection to the keys residing inside them, thereby 
providing some assurance that these keys have not been maliciously read or modified. Typically, 
gaining access to the contents of a tamper-resistant device requires knowledge of a PIN or password; 
exactly what type of access can be gained with this knowledge is device-dependent. 


Some tampet-resistant devices do not permit certain keys to be exported outside the hardware. This 
can provide a very strong guarantee that these keys cannot be abused: the only way to use these 
keys is to physically possess the particular device. Of course, these devices must actually be able 
to perform cryptographic functions with their protected keys, since these keys would otherwise be 
useless. 


Tamper-ptoof devices come in a variety of forms and capabilities. One common type of device is a 
“smart catd,” which is approximately the size and shape of a credit card. To use a smart card, one 
inserts it into a smart card reader that is attached to a computer. Smart Cards are frequently used 
to hold a user's private keys for financial applications; Mondex (see Question K.2.4) is a system that 
makes use of tamper-resistant hardware in this fashion. 


7.22 How are hardware devices made tamper-resistant? 


There are many techniques that are used to make hardware tamper-resistant (see Question [7.2T). 
Some of these techniques are intended to thwart direct attempts at opening a device and reading 
information out of its memory; others offer protection against subtler attacks, such as timing attacks 
and induced hardware-fault attacks. 


At a very high level, a few of the general techniques currently in use to make devices tamper-resistant 
are: 


e Employing sensors of various types (for example, light, temperature, and resistivity sensors) in 
attempt to detect occurrences of malicious probing. 


e Packing device circuitry as densely as possible (dense circuitry makes it difficult for attackers to 
use a logic probe effectively). 


e Using error-correcting memory. 


e Making use of non-volatile memory so that the device can tell if it has been reset (or how many 
times it has been reset). 


e Using redundant processors to perform calculations, and ensuring that all the calculated 
answers agree before outputting a result. 
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CHAPTER & 
Further Reading 


This chapter lists suggestions for further information about cryptography and related issues. 


8.1 Where canllearn more about cryptography? 


There ate a number of textbooks available to the student of cryptography. Among the most useful 
are the following three. 


Applied Cryptography by B. Schneier, John Wiley & Sons, Inc., 1996. Schneier’s book is an accessible 
and practically oriented book with very broad coverage of recent and established cryptographic 
techniques. 


Handbook of Applied Cryptography by A.J. Menezes, P.C. van Oorschot, S.A. Vanstone. CRC Press, 
1996. The HAC offers a thorough treatment of cryptographic theory and protocols, with a great 
deal of detailed technical information. It is an excellent reference book, but somewhat technical, and 
not aimed to serve as an introduction to cryptography. 


Cryptography: Theory and Practice by D. R. Stinson. CRC Press, 1995. This is a textbook, and includes 
exercises. Theory comes before practice in both title and content, but the book provides a good 
introduction to the fundamentals of cryptography. 


Por additional information, or more detailed information about specific topics, the reader is referred 
to the chapter summaries and bibliographies in any one of these texts. 


8.2 Where can l learn more about recent advances in cryptography? 


There are many annual conferences devoted to cryptographic research. The proceedings from these 
conferences are excellent sources for information about recent advances. The IACR sponsors many 
of the more prominent conferences; the [ACR web site 


http://www.lacr.org 


contains information on the proceedings from the conferences Crypto, Eurocrypt, and Asiacrypt. 
Some other major cryptographic research conferences are 


e ACM Conference on Computer and Communication Security ([http:/7/www.acm.org]) 
e IEEE Symposium on Security and Privacy ([http:/7/www.ieee.org. 
e ISOC Network and Distributed System Security Symposium ([http:/7/www.isoc.org]). 
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8.3 Where can l learn more about electronic commerce? 


As electronic commerce is a very rapidly changing field, the best resources are perhaps those available 
on the World Wide Web. The following is a selection of survey sites available as of the beginning of 
1998. 


Payment mechanisms designed for the Internet: 


iWORLD’s guide to electronic commerce: 
| 
Electronic Commerce, Payment Systems, and Security: 
http: //www.semper .org/sirene/outsideworld/ecommerce.htmll. 
Electronic Payment Schemes: 


http://www.w3.org/pub/WWW/Payments/roadmap.html}. 


8.4 Where can llearn more about cryptography standards? 


Several organizations are involved in defining standards related to aspects of cryptography and its 
applications. 


ANSI 

The American National Standards Institute (ANSI) has a broadly based standards program, and 
some of the groups within its Financial Services area (Committee X9; see Question B.3.1)) establish 
standards related to cryptographic algorithms. Examples include X9.17 (key management: wholesale), 
X9.19 (message authentication: retail), and X9.30 (public-key cryptography). Information can be 


found at 
http: //www.x9.org]. 
IEEE 


The Institute of Electrical and Electronic Engineers (IEEE) has a broadly based standards program, 
including P1363 (see Question 6.3.5). Information can be found at 


http://www.ieee.org). 


IETF 

The Internet Engineering Task Force (ETF) is the defining body for Internet protocol standards. 
Its security area working groups specify means for incorporating security into the Internet’s layered 
protocols. Examples include IP layer security (IPSec; see Question 6.1.4), transport layer security 
(TLS; see Question 6.1.2), Domain Name System security (DNSsec) and Generic Security Service 
API (GSS-API; see Question B.2.2). Information can be found at 


http://www.ietí.orgl]. 


ISO and ITU 

The International Standards Organization’s International Electrotechnical Commission (ISO/IEC) 
and the International Telecommunications Union’s Telecommunication Standardization Sector (ITU- 
T; see Question have broadly-based standards programs (many of which are collaborative 
between the organizations), which include cryptographically-related activities (see Question 6.3.4 
Example results are: [TU-T Recommendation X.509, which defines facilities for public-key certifi- 
cation, and the ISO/IEC 9798 document series, which defines means for entity authentication. ITU 
information can be found at 


http://www.itu.c 


and ISO information at 


NIST 
The U.S. National Institute of Standards and Technology (NIST)’s Information Technology Labo- 
ratory produces a series of information processing specifications (Federal Information Processing 
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Standards (FIPS)), several of which ate related to cryptographic algorithms and usage. Examples 
include FIPS PUB 46-3 (Data Encryption Standard (DES)) and FIPS PUB 186 (Digital Signature 
Standard (DSS)). Information is available at 


http://www.nist.gov. 


Open Group 

The Open Group produces a range of standards, some of which are related to cryptographic 
interfaces (APIs; see Question 6.2.1) and infrastructure components. Examples include Common 
Data Security Architecture (CDSA) and Generic Crypto Service API (GCS-API). Information can 
be found at 


http://www.opengroup.org}. 


PKCS 

RSA Laboratories is responsible for the development of the Public-key cryptography Standards 
(PKCS; see Question B.3.3) series of specifications, which define common cryptographic data 
elements and structures. Information can be found at 


http://www.rsasecurity.com/rsalabs/pkcs/|. 


8.5 Where can l learn more about laws concerning cryptography? 


The best way to learn more about any specific question you might have about laws concerning 
cryptography is to consult with an attorney. Beyond that, 


WWW.€ep1Cc.con 
and 
Gum cZ mel 


are web pages of organizations devoted to following laws and legislation concerning cryptography. 
Also, any legal archive is a good source for information about laws concerning cryptography. 
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Á 
Mathematical Concepts 


The purpose of this Appendix is to give a brief description of some of the mathematical concepts 
mentioned in this document. For a more thorough treatment of modular arithmetic and basic 
number theory, consider any undergraduate textbook in elementary algebra. For more information 
about groups, tings, and fields, we recommend [Era98]. For more details on analysis and the theory 
of limits, consult any undergraduate textbook in analysis. A good introduction to complexity theory 


is given in [GJ79]. 
A.1 FUNCTIONS 


A function f from a set A to a set B assigns to each element a in A a unique element b in B. For each 
element a € A, the corresponding element in B assigned to a by f is denoted f(a); we say that a is 
mapped to f(a). The notation f : A > B means that f is a function from A to B. 


Example Consider the set Z of integers. We may define a function f : Z — Z such that f(x) = 2? 
for each x € Z. For example, f(5) = 25. 


Let f : A= Bandg: B — C be functions. The composition go f of g and f is the function h : A— C 
defined as h(a) = g(f(a)) for each a € A. Note, however, that “f o g” does not make sense unless 
A=C. 


Example Let N be the set of nonnegative numbers. With f : Z — N defined as f(x) = x? and 
9: N= Z defined as g(y) = y — y?, we obtain that go f : Z — Z is the function h defined as 


A function f : A — B is one-to-one or injective if f(a) = f(a’) implies that a = a’, that is, no two 
elements in A are mapped to the same element in B. The function f is onto or surjective if, for each 
b € B, there exists an element a € A such that f(a) = b. Finally, f is bijective if f is one-to-one and 
onto. Given a bijective function f : A — B, the inverse fT} of f is the unique function g : B > A 
with the property that go f(a) = a for all a € A. A bijective function f : A —> A is a permutation of 
the set A. 


For any subset S of A, f(S) is the set of elements b such that f(a) = b for some a € S. Note 
that f being surjective means that f(A) = B. The restriction of f to a subset S of A is the function 
f: S —> B defined as f(s) = f(s) for all s € S. 


Examples 


e The function f : Z => Z defined as f(x) = x? is injective, because x? = y? implies that z = y. 
However, f is not surjective; for example, there is no x such that f(x) = 2. 


e Let |x| be the absolute value of x € Z (for example, | — 5] = |5| = 5). The function g : Z => N 
defined as g(x) = |x| is surjective but not injective. Namely, for all x, the elements x and 
—x are mapped to the same element |x|. However, the restriction of g to N is injective and 
surjective, hence bijective. 


e If Aand B are finite sets of the same size, then a function f : A — B is injective if and only if 
f is surjective. 
A.2 MODULAR ARITHMETIC 
Given integers a, b, and n with n > 0, we say that a and b are congruent modulo n if a — b is divisible by 
n, that is, if a is an integer. We write 
a=b (mod n) 
if a and b are congruent modulo n and 
a#b (mod n) 


if they are not. Let a,b,c, and d be integers such that a = c (mod n) and b = d (mod n). It is not 
difficult to prove that 
a+b=c+d (mod n) (A.1) 


and 

ab=cd (mod n). (A.2) 
Given a fixed integer n > 0, called the modulus, we may form congruence classes of integers modulo n. 
Each congruence class is formally a set of the form 


[a] :=a4+nZ={...,a-2n,a—n,a,a+n,a+ 2n,...}. 


By (A.T) and (A.J, addition and multiplication of congruence classes is well-defined. More precisely, 
we define [a] + [b] = [a + b] and [a] - [b] = [ab]. It is convenient to identify the element [a] with 
the smallest nonnegative number a’ such that a = a’ (mod n). This number a’ will be denoted 
a mod n. For example, 13 mod 5 = 3. Let Z, denote the set of congruence classes modulo n. For 
example, Z5 = {0, 1,2,3,4}. 


The greatest common divisor (gcd) of two integers m and n is the greatest positive integer d such that d 
divides both m and n. For example, gcd(91, 52) = 13. The Euclid algorithm states that if gcd(m, n) = d, 
then there are integers r and s such that mr + ns = d. In particular, the equation 


mz=b (mod n) => mz =b (in Zn) (A.3) 


has a solution x if and only if b is divisible by d. 


Let Z;, be the set of integers (congruence classes modulo n) k € {1,...,n — 1} with the property 
that gcd(k, n) = 1. For example, Zí, = {1,5,7, 11}. 
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A.3 GROUPS 


Consider a prime number p. The procedures of adding elements in Z, and multiplying elements in 
Zy share certain properties: 


(1) Both operations ate associative, that is, a + (b + c) = (a+ b) + cand a(bc) = (ab)c. 


(2) There is an additive ¿dentity 0 with the property that 0+-a = a+0 = a for all a. The corresponding 
multiplicative identity is the element 1;1-a=a-l=a. 


(3) For each a € Zp, there is a b such that a + b = 0; namely, b = —a has this property. By (A.3), 
the equation ax = 1 has an integer solution x := a”? for each a € Z;. Namely, since p is a 
prime, gcd(a, p) = 1. The elements —a and a”! are the additive and multiplicative ¿nverses of a, 
respectively. 


Structures with these three properties have turned out to be of such a great importance that they 
have a name; they are called groups. 


Formally, a group consists of a set G (finite or infinite) together with a binary operation * : Gx G — G 
called (group) multiplication. Note that “x : Gx G — G” means that G is closed under multiplication, 
that is, the product a * b is in G for any two elements a,b in G. A group must satisfy the following 
axioms: 


(G1) The operation x is associative, that is, a * (bx c) = (a * b) * c for any a,b,c E€ G. 

(G2) There exists an ¿dentity element e € G such that a * e = e x a = a for each element a € G. 

(G3) Each element a € G has an inverse b € G satisfying a * b = bx a = e = the identity. 

If, in addition, multiplication in G is commutative, that is, a x b = b x a for any two elements a,b € G, 


then the group is abelian. 


A group is usually identified with its underlying set, unless the group operation is not clear from 
context. From now on, we will suppress the group operation * and simply write ab instead of a * b. 
For n > 1, g” means multiplication of g with itself n times (for example, g? = ggg), while g~” is the 
inverse of g”. g° is the identity element. Note that g7g° = g**® for all integers a, b. 


A subgroup H of a group G is a group such that the set H is a subset of G. Any subset S of G generates 
a subgroup (S) of G consisting of all elements of the form 


aL An 
si sp", 
where s1,...,Sn ate (not necessarily distinct) elements in S and Q1,...,Q, ate (not necessarily 


positive) integers. If G = (g) for some g € G, then G is cyclic with generator g. This means that every 
element in G is of the form g” for some integer k. All cyclic groups are abelian. 


Examples 


e The set Z of integers is a cyclic group under addition with generator 1. However, the set of 


nonzero integers is 20f a group under multiplication. Namely, for a # +1, there is no integer b 
such that ab = 1. 


e The sets Q,R, and C of rational, real, and complex numbers are all abelian groups under 
addition. Moreover, Q*, R*, and C* (the above sets with 0 removed) are all abelian groups 
under multiplication. Namely, the inverse of a number z is 1/z. 


e The set Z,, is a cyclic group under addition. If n = ab is a composite number with a, b > 1, then 
the set {1,..., — 1} is not a group under multiplication modulo n. Namely, the product of a 
and b is equal to 0 modulo n, which implies that the set is not even closed under multiplication. 
However, the subset Z% is a group under multiplication. If n is a prime, then Z% is a cyclic 
group of order p — 1. 


The set Z under subtraction is not a group. Namely, subtraction is not associative; a — (b—c) 4 
(a — b) — c unless c= 0. 


e For a given set A, the set S4 of permutations (bijective functions) 7: A — A is a group under 
composition o. For example, composition is associative, because 


mo (po o)(a) = m(p(a(a))) = (ro p) o o(a). 


However, unless A consists of at most two elements, Sa is not abelian. For example, with 
A = Z3, (a) = a + 1, and o(a) = 2a, we have 


m o o(0) = (0(0)) = (0) = 1 42=0(1) = 0(1(0)) = 0 o (0). 


A.4 FIELDS AND RINGS 


One interesting observation from the examples in the previous section is that each of the sets Zp, 
R, Q, and C contains two different abelian group structures: the set itself under addition and the set 
of nonzero elements under multiplication. Structures satisfying this property together with an axiom 
about multiplication “distributing” over addition are called fields. 


Formally, a field consists of a set F together with two operations +: F x F —> Fand::FxF—-F 
called addition and multiplication, respectively, such that the following axioms are satisfied. 
(F1) F forms an abelian group under addition. 


(F2) F \ {0} forms an abelian group under multiplication, where 0 is the identity in the additive 
abelian group (F, +). 


(F3) Multiplication distributes over addition, that is,a-(b+c) =a-b+a-e. 


For an integer n and a field element x, n - x denotes the element obtained by adding x to itself n 
times; for example, 3-2 = x + x + x. The characteristic of a field is the smallest positive integer p 
such that p- 1 = 0. If no such p exists, then the characteristic of the field is defined to be 0. The 
characteristic of a field is either a prime number or 0. If the characteristic of a field is 0, then the field 
is infinite. However, a field with nonzero characteristic might be either finite or infinite. 


Examples The fields Q, R, and C of rational, real, and complex numbers, respectively, are fields 
of characteristic O. The finite field Z, is a field of characteristic p. 


The number of elements in a finite field must be a power a prime number. A classification theorem 
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of the finite fields states that there is exactly one finite field (up to ¿somorphism, see [Era98)) of size q 
for each prime power number q. Thus it makes sense talking about the field with q elements, which 
is traditionally denoted GF (q) (GF = Galois Field) or F}. 


A ring R satisfies axioms (F1) and (F3), but instead of (F2), multiplication in R is only required to be 
associative. If multiplication is commutative, then the ring is commutative. A nonzero element a in a 
ring is a gero divisor if there is a nonzero element b such that ab = 0. There are two main classes of 
commutative rings: 


e Rings with no zero divisors. All fields and the ring Z of integers are of this kind. 
e Rings with zero divisors. The ring Zn contains zero divisors if and only if n is composite. 
A polynomal in a ting R is a function f : R — R of the form 
f(a) = ao + aix + aga? +--+ + anz”, 


where ao, ..., an ate elements in the ring. A root of a polynomial is an element r such that f(r) = 0. 


A.5 VECTOR SPACES AND LATTICES 


In most undergraduate programs in mathematics the theory of linear algebra and vector spaces is 
introduced before the theory of groups and rings. This makes sense as vector spaces are easier to 
comprehend than groups and rings. The reason for putting this section after the preceding sections 
is simply that we now need fewer axioms to define a vector space. 


An n-dimensional vector space over the real numbers can be viewed as the set R” of n-tuples 
(a1,...,@n) of real numbers. The n-tuples are called vectors. There are two basic operations on such 
a set: addition and scalar multiplication. Addition of vectors is defined as 


(Mio ln) + (Biju nag bn) = (a1 + b1,- .., an + bn), 
while multiplication of a scalar (real number) r and a vector (a1, .. . an) is defined as 
r- (ai,..., an) = (rai, ..., Tan). 


Given a set S = {v1,.. . , Up } of vectors in R”, the /attice L(S) generated by S is the set of all integer 
combinations of elements in S. That is, L(S) contains all linear combinations 


N1U1 + +++ NKUL, 


where n1,...,n% € Z. For example, the set of all vectors in R? with integer coordinates is the lattice 
generated by the unit vectors (1,0, 0), (0,1, 0), and (0,0, 1). 

Formally, a vector space over the field F is a set V of elements called vectors together with an operation 
+: V x V — V called addition and an operation: : F x V —= V called scalar multiplication such that 


(V1) V is an abelian group under addition. 


(V2) a- (b-v) = (ab) - v for all a,be F,v € V. 


(V3) a-(u+w)=a-v+a-wforalla € F,v,we€ V. 


One may replace F with an arbitrary ring R, but then V is called a module over R and not a vector 
space. For example, a lattice is a module over Z. 


The elements v1,...,Ux E V ina vector space ate linearly dependent if there are elements a1,..., ax E€ F 
not all O such that 
01:01 +++ + ap: Uh =0. 


Otherwise the vectors v1,..., uz ate /inearly independent. The vectors e,,..., en form a basis for V if 
they are linearly independent and span V, that is, for each v € V, there are elements 41,..., dy € F 
such that 

a11 +*** + anen = 0. (A.4) 


If there is such a basis, then every basis has the same number of elements; this number is called the 
dimension of V. In fact, every vector space has a basis, but it need not be finite in general. 


With a fixed basis [e1,...,en) of the n-dimensional vector space V, any element v € V can be 
written as an n-tuple (a1, .. . , an), where a1,..., an ate the unique elements satisfying (A.4). Objects 
in the ordinary three-dimensional space such as planes and lines are easily generalized to arbitrary 
finite-dimensional vector spaces. For example, an affine hyperplane in V is the set of all points 
x= (11,..., Zn) satisfying the equation 


0121 +--+ + antn = b, 


where a1,...,an,b € F are some fixed constants (not all a; are zero). The word “affine” simply 
means that the element b does not have to be 0. The concept of a hyperplane generalizes the concept 
of an affine plane in R?, which has the form 


ax+by+cz=d 
for some constants a,b,c,d € R. A line in V is a set of the form {av + (1 — ajw : a € F}, where 
v,w E V are two different vectors. With w = 0, we obtain a line through the origin. 
A.6 BOOLEAN EXPRESSIONS 


Let B be a set with two elements, say B = {1,0} (B = {TRUE, FALSE} is another possibility). A 
boolean expression can be viewed as a function 


f: B” — B, 


where n is a nonnegative integer indicating the number of variables in the expression. A typical 
boolean expression is built up by a number of unary and binary operations. The most useful unary 
operation is — (negation), which is defined as =p = 1 — p. Some of the most important binary 
operations are A (AND), V (OR), $ (XOR), —, and =: 


¡ip al ona[pvalpoq|p>alp>q 


0 0 0 0 0 1 1 
0 1 0 1 1 1 0 
1 0 0 1 1 0 0 
1 1 1 1 0 1 1 
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For example, the boolean expression 


f(p,a,r) = (p> q) A (a — r) 


is equal to 1 (TRUE) if and only ifp <q <r. 


Another more general kind of boolean expression is a function 
f: B” => B”, 


where m is a positive integer. With m = n, we may identify a couple of useful operations. Note that 
an element in B” can be interpreted as the binary representation of an integer between 0 and 2” — 1. 
In this manner we may perform addition and multiplication modulo 2” as described in Section 
Another useful operation is rotation: For w = (w1,..., Wn) € B” and k an integer, let w << k mean 
that we rotate the content of w k steps to the left. For example, 


(wi, W2, W3, W4, W5, W6, w7) < 3 = (wa, W5, W6, W7, W1, W2, w3) 
Similarly, w > k means rotation of w k steps to the right. 


A.7 TIME ESTIMATIONS AND SOME COMPLEXITY THEORY 


Let N be the set of nonnegative integers and let R be the set of real numbers. In the study of 
functions f : N — R one often wants to estimate the approximate size of f(n) for large n. The 
concept of limits is very helpful for such instances. If f(n) approaches a real number c as n becomes 
larger, then f(n) is said to have the Amit c as n tends to infinity, denoted 


lim f(n) =c. 


Formally, f(n) has the limit c if for each e > 0 there is a number N such that | f(n) — c| < e for all 
n>N. 


Exam ples 


e With f(n) = 1/n, we have 


lim f(n)=0 
e One may prove that 
lim == 
whenever b > 1. 
e With f(n) = e” + n?, we have 
lim f(n) =o, 


that is, f(n) tends to infinity when n grows large. However, 


by the previous example. 


In the last example, the second limit told us more about the function f than the first. The procedure 
of estimating a complicated function f using an easier function g such as a polynomial or an 
exponential function is sometimes very useful. However, instead of trying to compute the exact limit 
of the fraction f(n)/g(n) as in the example, one might be content with only a rough estimate of the 
behavior of the fraction. The “big-O” notation has proven to be a very useful tool for this purpose. 
Formally, we say that f(n) is O(g(n)) if there is a constant C such that | f(n)| < C|g(n)| for all n (or 
at least for all n larger than some integer N). This means that the fraction f(n)/g(n) is bounded by 
the constant C. For example, f(n) = 7sinn e” is O(e”), because 


| f(m)| < Te”. 


Say that we have an algorithm (a procedure taking an input and producing an output following certain 
rules) and let f(n) denote the maximal time needed to produce the output, where n is the size of the 
input. For instance, if the input is an integer, then n is normally the number of digits in the binary 
representation of the integer. 


We say that the algorithm is a polynomial time algorithm in n if there is an integer k such that f(n) is 
O(n*). We say that an algorithm is sub-exponential if f(n) is O(a”) for all a > 1. All polynomial 
time algorithms are sub-exponential, but there are sub-exponential time algorithms that are not 
polynomial. For example, with f(n) = ev” we have 


f(n)/a” — evr=nloga +0 


and 
f(n)/n* — eVr—klogn 300 


when n tends to infinity. The algorithm is an exponential time algorithm if it is not sub-exponential and if 

it is O(b") for some b > 1. There are algorithms that are even slower than exponential (for example, 
: 2 : . į : . 

consider f(n) = e”). However, in most applications the “worst” algorithms are at most exponential. 
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In this glossary brief descriptions of the most important concepts are given. Mathematical concepts 
are treated in more detail in Appendix [Al For more information, we refer to earlier chapters in this 
document. 


abelian group An abstract group with a commutative binary operation; see Section 


adaptive-chosen-ciphertext A version of the chosen-ciphertext attack where the cryptanalyst can choose 
ciphertexts dynamically. A cryptanalyst can mount an attack of this type in a scenario in which he or she 


has free use of a piece of decryption hardware, but is unable to extract the decryption key from it. 


adaptive-chosen-plaintext A special case of the chosen-plaintext attack in which the cryptanalyst is able to 


choose plaintexts dynamically, and alter his or her choices based on the results of previous encryptions. 


adversary Commonly used to refer to the opponent, the enemy, or any other mischievous person that desires to 


compromise one’s security. 


AES The Advanced Encryption Standard that will replace DES (The Data Encryption Standard) around the turn 
of the century. 


algebraic attack A method of cryptanalytic attack used against block ciphers that exhibit a significant amount of 


mathematical structure. 
algorithm A series of steps used to complete a task. 
Alice The name traditionally used for the first user of cryptography in a system; Bob’s friend. 
ANSI American National Standards Institute. 
API Application Programming Interface. 


attack Either a successful or unsuccessful attempt at breaking part or all of a cryptosystem. See algebraic 
attack, birthday attack, brute force attack, chosen ciphertext attack, chosen plaintext attack, differential 
ctyptanalysis, known plaintext attack, linear cryptanalysis, middleperson attack. 


authentication The action of verifying information such as identity, ownership or authorization. 


big-O notation Used in complexity theory to quantify the long-term time dependence of an algorithm with 
respect to the size of the input. See Section A7} 


biometrics The science of using biological properties to identify individuals; for example, finger prints, a retina 


scan, and voice recognition. 


birthday attack A brute-force attack used to find collisions. It gets its name from the surprising result that the 
probability of two or mote people in a group of 23 sharing the same birthday is greater than 1/2. 


bit A binary digit, either 1 or 0. 


blind signature scheme Allows one party to have a second party sign a message without revealing any (or very 


little) information about the message to the second party. 
block A sequence of bits of fixed length; longer sequences of bits can be broken down into blocks. 


block cipher A symmetric cipher which encrypts a message by breaking it down into blocks and encrypting each 
block. 


block cipher based MAC MAC that is performed by using a block cipher as a keyed compression function. 
Bob The name traditionally used for the second user of cryptography in a system; Alice’s friend. 


boolean expression A mathematical expression in which all variables involved are either O or 1; it evaluates to 
either 0 or 1. See Section AG. 


brute force attack This attack requires trying all (or a large fraction of all) possible values till the right value is 


found; also called an exhaustive search. 
CA See certifying authority 
CAPI Cryptographic Application Programming Interface. 


Capstone The U.S. government’s project to develop a set of standards for publicly available cryptography, as 
authorized by the Computer Security Act of 1987. See Clipper, DSA, DSS, and Skipjack. 


certificate In cryptography, an electronic document binding some pieces of information together, such as a uset’s 
identity and public-key. Certifying Authorities (CA’s) provide certificates. 


certificate revocation list A list of certificates that have been revoked before their expiration date. 
Certifying Authority (CA) A person or organization that creates certificates. 


checksum Used in error detection, a checksum is a computation done on the message and transmitted with the 


message; similar to using parity bits. 
chosen ciphertext attack An attack where the cryptanalyst may choose the ciphertext to be decrypted. 


chosen plaintext attack A form of cryptanalysis where the cryptanalyst may choose the plaintext to be 
encrypted. 


cipher An encryption-decryption algorithm. 
ciphertext Encrypted data. 


ciphertext-only attack A form of cryptanalysis where the cryptanalyst has some ciphertext but nothing else. 
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Clipper Clipper is an encryption chip developed and sponsored by the U.S. government as part of the Capstone 


project. 
collision Two values x and y form a collision of a (supposedly) one-way function F if x 4 y but F(x) = F(y). 


collision-free A hash function is collision-free if collisions are hard to find. The function is weakly collision-free 
if it is computationally hard to find a collision for a given message x. That is, it is computationally 
infeasible to find a message y # x such that H(x) = H(y). A hash function is strongly collision-free if 
it is computationally infeasible to find any messages £, y such that x # y and H(x) = H(y). 


collision search The search for a collision of a one-way function. 


commutative When a mathematical operation yields the same result regardless of the order the objects are 
operated on. For example, if a,b are integers, then a + b = b+ a, that is, addition of integers is 


commutative. 


computational complexity Refers to the amount of space (memory) and time required to solve a problem. 
Space refers to spatial (memory) constraints involved in a certain computation, while time refers to the 


temporal constraints involved in the computation. 


compression function A function that takes a fixed length input and returns a shorter, fixed length output. See 


also hash functions. 
compromise The unintended disclosure or discovery of a cryptographic key or secret. 


concatenate To place two (or more) things together one directly after the other. For example, treehouse is the 


concatenation of the words tree and house. 
covert channel A hidden communication medium. 
CRL Certificate Revocation List. 
cryptanalysis The art and science of breaking encryption or any form of cryptography. See attack. 


cryptography The art and science of using mathematics to secure information and create a high degree of trust 


in the electronic realm. See also public key, secret key, symmetric-key, and threshold cryptography. 
ctyptology The branch of mathematics concerned with cryptography and cryptanalysis. 


cryptosystem An encryption-decryption algorithm (cipher), together with all possible plaintexts, ciphertexts and 
keys. 


Data Encryption Standard See DES. 
decryption The inverse (reverse) of encryption. 


DES Data Encryption Standard, a block cipher developed by IBM and the U.S. government in the 1970’s as an 
official standard. See also block cipher. 


dictionary attack A brute force attack that tries passwords and or keys from a precompiled list of values. This is 


often done as a precomputation attack. 


Diffie-Hellman key exchange A key exchange protocol allowing the participants to agree on a key over an 


insecure channel. 


differential cryptanalysis A chosen plaintext attack relying on the analysis of the evolution of the differences 
between two plaintexts. 


digest Commonly used to refer to the output of a hash function, e.g. message digest refers to the hash of a 


message. 
digital cash See electronic money 


digital envelope A key exchange protocol that uses a public-key cryptosystem to encrypt a secret key for a 
sectet-key cryptosystem. 


digital fingerprint See digital signature. 
digital signature The encryption of a message digest with a private key. 
digital timestamp A record mathematically linking a document to a time and date. 


discrete logarithm Given two elements d, g in a group such that there is an integer r satisfying g” = d, r is 
called the discrete logarithm of d in the “base” g. 


discrete logarithm problem The problem of finding r such that g” = d, where d and g are elements in a 


given group. For some groups, the discrete logarithm problem is a hard problem used in public-key 


cryptography. 


distributed key A key that is split up into many parts and shared (distributed) among different participants. See 


also secret sharing. 
DMS Defense Messaging Service. 
DOD Department of Defense. 
DSA Digital Signature Algorithm. DSA is a public-key method based on the discrete logarithm problem. 
DSS Digital Signature Standard. DSA is the Digital Signature Standard. 
EAR Export Administration Regulations. 
ECC Elliptic Curve Cryptosystem; A public-key cryptosystem based on the properties of elliptic curves. 
ECDL See elliptic curve discrete logarithm. 
EDI Electronic (business) Data Interchange. 
electronic commerce (e-commerce) Business transactions conducted over the Internet. 
electronic mail (e-mail) Messages sent electronically from one person to another via the Internet. 
electronic money Electronic mathematical representation of money. 
elliptic curve The set of points (x, y) satisfying an equation of the form 
yY? =r? +ar+b 


for variables x, y and constants a,b € F, where F is a field. 
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elliptic curve cryptosystem See ECC. 


elliptic curve discrete logarithm (ECDL) problem The problem of finding m such that m: P = Q, where 


P and Q ate two points on an elliptic curve. 


elliptic curve (factoring) method A special-purpose factoring algorithm that attempts to find a prime factor 
p of an integer n by finding an elliptic curve whose number of points modulo p is divisible by only small 


primes. 


encryption The transformation of plaintext into an apparently less readable form (called ciphertext) through a 
mathematical process. The ciphertext may be read by anyone who has the key that decrypts (undoes the 
encryption) the ciphertext. 


exclusive-OR See XOR. 
exhaustive search Checking every possibility individually till the right value is found. See also attack. 
expiration date Certificates and keys may have a limited lifetime; expiration dates are used to monitor this. 


exponential function A function where the variable is in the exponent of some base, for example, b” where x 


is the variable, and b > 0 is some constant. 


exponential running time A running time of an algorithm that is approximately exponential as a function of 
the length of the input. 


export encryption Encryption, in any form, which leaves its country of origin. For example, encrypted 


information or a computer disk holding encryption algorithms that is sent out of the country. 


factor Given an integer n, any number that divides it is called a factor of n. For example, 7 is a factor of 91, 


because 91/7 is an integer. 
factoring The breaking down of an integer into its prime factors. This is a hard problem. 


factoring methods See elliptic curve method, multiple polynomial quadratic sieve, number field sieve, Pollard 
p — 1 and Pollard p + 1 method, Pollard rho method, quadratic sieve. 


FBI Federal Bureau of Investigation, a U.S. government law enforcement agency. 


Feistel cipher A special class of iterated block ciphers where the ciphertext is calculated from the plaintext by 


repeated application of the same transformation called a round function. 


field A mathematical structure consisting of a finite or infinite set F together with two binary operations called 
addition and multiplication. Typical examples include the set of real numbers, the set of rational numbers, 


and the set of integers modulo p. See Section [A.4 for more detailed information. 
FIPS Federal Information Processing Standards. See NIST. 
flat key space See linear key space. 


function A mathematical relationship between two values called the input and the output, such that for each input 


there is precisely one output. For example, f defined on the set of real numbers as f(x) = x? is a 


function with input any real number x and with output the square of x. 


Galois field A field with a finite number of elements. The size of a finite field must be a power of prime number. 
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generalpurpose factoring algorithm An algorithm whose running time depends only on the size of the 


number being factored. See special purpose factoring algorithm. 
Goppa code A class of error correcting codes, used in the McEliece public-key cryptosystem. 


graph In mathematics, a set of elements called vertices or nodes together with a set of unordered pairs of vertices 


called edges. Intuitively speaking, an edge is a line joining two vertices. 


graph coloring problem The problem of determining whether a graph can be colored with a fixed set of colors 
such that no two adjacent vertices have the same color and producing such a coloring. Two vertices are 


adjacent if they are joined by an edge. 


group A mathematical structure consisting of a finite or infinite set together with a binary operation called group 


multiplication satisfying certain axioms; see Section [1.3] for more information. 
GSS-API genetic security service application program interface. 
hacker A person who tries and/or succeeds at defeating computer security measures. 


Hamiltonian path problem Determine whether a given graph contains a Hamiltonian graph. A Hamiltonian 
path is a path in a graph that passes through each vertex exactly once. This is a hard problem. 


handshake A protocol two computers use to initiate a communication session. 

hard problem A computationally-intensive problem; a problem that is computationally difficult to solve. 
hash-based MAC MAC that uses a hash function to reduce the size of the data it processes. 

hash function A function that takes a variable sized input and has a fixed size output. 

HMAC see MAC. 


hyperplane A mathematical object which may be thought of as an extension (into higher dimensions) of 


a 2-dimensional plane passing through the point (0,0,0) in a 3-dimensional vector space. See 


Appendix [Al 
IEEE Institute of Electrical and Electronics Engineers, a body that creates some cryptography standards. 
IETF Internet Engineering Task Force. 
identification A process through which one ascertains the identity of another person or entity. 
iKP Internet Keyed Payments Protocol. 
impersonation Occurs when an entity pretends to be someone or something it is not. 
import encryption Encryption, in any form, coming into a country. 
index calculus A method used to solve the discrete logarithm problem. 


integer programming problem The problem is to solve a linear programming problem where the variables 


are restricted to integers. 
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interactive proof A protocol between two parties in which one party, called the prover, tries to prove a certain 
fact to the other party, called the verifier. This is usually done in a question response format, where the 


verifier asks the prover questions that only the prover can answer with a certain success rate. 
Internet The connection of computer networks from all over the world forming a worldwide network. 
intractable In complexity theory, referring to a problem with no efficient means of deriving a solution. 
ISO International Standards Organization, creates international standards, including cryptography standards. 
ITU-T International Telecommunications Union -- Telecommunications standardization sector. 
Kerberos An authentication service developed by the Project Athena team at MIT. 


key A string of bits used widely in cryptography, allowing people to encrypt and decrypt data; a key can be used 
to perform other mathematical operations as well. Given a cipher, a key determines the mapping of 
the plaintext to the ciphertext. See also distributed key, private key, public key, secret key, session key, 
shared key, sub key, symmetric key, weak key. 


key agreement A process used by two or more parties to agree upon a secret symmetric key. 
key escrow The process of having a third party hold onto encryption keys. 

key exchange A process used by two mote patties to exchange keys in cryptosystems. 

key expansion A process that creates a larger key from the original key. 

key generation The act of creating a key. 


key management The various processes that deal with the creation, distribution, authentication, and storage of 
keys. 


key pair The full key information in a public-key cryptosystem, consisting of the public key and private key. 


key recovery A special feature of a key management scheme that allows messages to be decrypted even if the 


original key is lost. 
key schedule An algorithm that generates the subkeys in a block cipher. 


key space The collection of all possible keys for a given cryptosystem. See also flat key space, linear key space, 


nonlinear key space, and reduced key space. 


knapsack problem A problem that involves selecting a number of objects with given weights from a set, such 
that the sum of the weights is maximal but less than a pre-specified weight. 


known plaintext attack A form of cryptanalysis where the cryptanalyst knows both the plaintext and the 


associated ciphertext. 
lattice A lattice can be viewed as a grid in an n-dimensional vector space. See Section A3 
LEAF Law Enforcement Agency Field a component in the Clipper Chip. 


life cycle The length of time a key can be kept in use and still provide an appropriate level of security. 


linear complexity Referring to a sequence of 0's and 1’s, the size of the smallest linear feedback shift register 
(LFSR) that would replicate the sequence. See also linear feedback shift register. 


linear cryptanalysis A known plaintext attack that uses linear approximations to describe the behavior of the 
block cipher. See known plaintext attack. 


linear key space A key space where each key is equally strong. 


LFSR linear feedback shift register. Used in many keystream generators because of its ability to produce sequences 


with certain desirable properties. 
MAC See message authentication code. 


meet-in-the-middle attack A known plaintext attack against double encryption with two separate keys where 
the attacker encrypts a plaintext with a key and “decrypts” the original ciphertext with another key and 


hopes to get the same value. 


Message Authentication Code(MAC) A MAC is a function that takes a variable length input and a key to 
produce a fixed-length output. See also hash-based MAC, stream-cipher based MAC, and block-cipher 
based MAC. 


message digest The result of applying a hash function to a message. 

MHS Message Handling System. 

middleperson attack A person who intercepts keys and impersonates the intended recipients. 
MIME Multipurpose Internet Mail Extensions. 

MIPS Millions of Instructions Per Second, a measurement of computing speed. 

MIPS-Year One year’s worth of time on a MIPS machine. 


mixed integer programming The problem is to solve a linear programming problem where some of the 


variables are restricted to being integers. 


modular arithmetic A form of arithmetic where integers are considered equal if they leave the same remainder 
when divided by the modulus. See Section AJ 


modulus The integer used to divide out by in modular arithmetic. 


multiple polynomial quadratic sieve(MPQS) A variation of the quadratic sieve that sieves on multiple 
polynomials to find the desired relations. MPQS was used to factor RSA-129. 


NIST National Institute of Standards and Technology, a United States agency that produces security and 
ctyptography related standards (as well as others); these standards are published as FIPS documents. 


non-repudiation A property of a cryptosystem. Non-repudiation cryptosystems are those in which the users 


cannot deny actions they performed. 
nondeterministic Not determined or decided by previous information. 


nondeterministic computer Currently only a theoretical computer capable of performing many computations 


simultaneously. 
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nonlinear key space A key space comprised of strong and weak keys. 


NP Nondeterministic polynomial running time. If the running time, given as a function of the length of the input, 
is a polynomial function when running on a theoretical, nondeterministic computer, then the algorithm 
is said to be NP. 


NP-complete An NP problem is NP-complete if any other NP problem can be reduced to it in polynomial time. 


NSA National Security Agency. A security-conscious U. S. government agency whose mission is to decipher and 


monitor foreign communications. 


number field sieve A method of factoring, currently the fastest general purpose factoring algorithm published. 
It was used to factor RSA-130. 


number theory A branch of mathematics that investigates the relationships and properties of numbers. 
OAEP Optimal Asymmetric Encryption Padding; a provably secure way of encrypting a message. 


one-time pad A secret-key cipher in which the key is a truly random sequence of bits that is as long as the 
message itself, and encryption is performed by XORing the message with the key. This is theoretically 


unbreakable. 


one-way function A function that is easy to compute in one direction but quite difficult to reverse compute 


(compute in the opposite direction.) 
one-way hash function A one-way function that takes a variable sized input and creates a fixed size output. 


P Polynomial running time. If the running time, given as a function of the length of the input is bounded by a 
polynomial, the algorithm is said to have polynomial running time. Polynomial running time algorithms 


are sub-exponential, but not all sub-exponential algorithms are polynomial running time; one example is 


eve 
patent The sole right, granted by the government, to sell, use, and manufacture an invention or creation. 
PKI Public-key Infrastructure. PKIs are designed to solve the key management problem. See also key management. 
padding Extra bits concatenated with a key, password, or plaintext. 
password A character string used as a key to control access to files or encrypt them. 


PKCS Public-key cryptography Standards. A series of cryptographic standards dealing with public-key issues, 
published by RSA Laboratories. 


plaintext The data to be encrypted. 


plane A geometric object in 3-dimensional space defined by an equation of the form Ar + By + Cz = D 
(A, B,C notall 0), that is, the plane contains every point (x, y, z) satisfying this equation. For example, 
z = 0 gives the (x, y) plane. 


Pollard p — 1 and Pollard p+ 1 methods Algorithms that attempt to find a prime factor p of a number n by 


exploiting properties of p — 1 and p + 1, respectively. See also factoring, prime factor, prime number. 


Pollard Rho method A method for solving the discrete logarithm and elliptic curve discrete logarithm. 


polynomial An algebraic expression written as a sum of constants multiplied by different powers of a variable, 


that is, an expression of the form 
-1 
Ant” + Ap-12" +- + az + ao, 


where the aj are the constants and z is the variable. A polynomial can be interpreted as a function with 


input value x. 


precomputation attack An attack where the adversary precomputes a look-up table of values used to crack 


encryption or passwords. See also dictionary attack. 


primality testing A test that determines, with varying degree of probability, whether or not a particular number 


is prime. 
prime factor A prime number that is a factor of another number is called a prime factor of that number. 


prime number Any integer greater than 1 that is divisible only by 1 and itself. The first twelve primes are 
2,3,5,7,11,13,17,19,23,29,31, and 37. 


privacy The state or quality of being secluded from the view and or presence of others. 
private exponent The private key in the RSA public-key cryptosystem. 


private key In public-key cryptography, this key is the secret key. It is primarily used for decryption but is also 


used for encryption with digital signatures. 


proactive security A property of a cryptographic protocol or structure which minimizes potential security 


compromises by refreshing a shared key or secret. 
probabilistic signature scheme (PSS) A provably secure way of creating signatures using the RSA algorithm. 
protocol A series of steps that two or more parties agree upon to complete a task. 


provably secure A property of a digital signature scheme stating that it is provably secure if its security can be 
tied closely to that of the cryptosystem involved. See also digital signature scheme. 


pseudo-random number A number extracted from a pseudo-random sequence. 


pseudo-random sequence A deterministic function which produces a sequence of bits with qualities similar to 


that of a truly random sequence. 
PSS Sce probabilistic signature scheme. 
public exponent The public key in the RSA public-key cryptosystem. 


public key In public-key cryptography this key is made public to all, it is primarily used for encryption but can be 


used for verifying signatures. 
public-key cryptography Cryptography based on methods involving a public key and a private key. 
quadratic sieve A method of factoring an integer, developed by Carl Pomerance. 


quantum computer A theoretical computer based on ideas from quantum theory; theoretically it is capable of 


operating nondeterministically. 
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RSA algorithm A public-key cryptosystem based on the factoring problem. RSA stands for Rivest, Shamir and 
Adleman, the developers of the RSA public-key cryptosystem and the founders of RSA Data Security 
(now RSA Security). 


random number As opposed to a pseudo-random number, a truly random number is a number produced 
independently of its generating criteria. For cryptographic purposes, numbers based on physical 


measurements, such as a Geiger counter, are considered random. 


reduced key space When using an n bit key, some implementations may only use r < n bits of the key; the 


result is a smaller (reduced) key space. 


relatively prime Two integers are relatively prime if they have no common factors. For example, 14 and 25 are 


relatively prime, while 14 and 91 are not; 7 is a common factor. 
reverse engineer To ascertain the functional basis of something by taking it apart and studying how it works. 
rounds The number of times a function, called a round function, is applied to a block in a Feistel cipher. 


running time A measurement of the time required for a particular algorithm to run as a function of the input 
size. See also exponential running time, nondeterministic polynomial running time, polynomial running 


time, sub-exponential running time, and Section A. 


S-HTTP Secure HyperText Transfer Protocol, a secure way of transferring information over the World Wide 
Web. 


S/MIME Secure Multipurpose Internet Mail Extensions. 
SSL Secure Socket Layer. A protocol used for secure Internet communications. 
SWIFT Society for Worldwide Interbank Financial Telecommunications. 


salt A string of random (or pseudo-random) bits concatenated with a key or password to foil precomputation 


attacks. 


satisfiability problem Given a Boolean expression determine if there is an assignment of 1’s and 0’s such that 


the expression evaluates to 1. This is hard problem. 
secret key In secret-key cryptography, this is the key used both for encryption and decryption. 


secret sharing Splitting a secret (e.g. a private key) into many pieces such that any specified subset of k pieces 


may be combined to form the secret, but k — 1 pieces are not enough. 
secure channel A communication medium safe from the threat of eavesdroppers. 
seed A typically random bit sequence used to generate another, usually longer pseudo-random bit sequence. 
self-shrinking generator A stream cipher where the output of an LFSR is allowed to feed back into itself. 
self-synchronous Referring to a stream cipher, when the keystream is dependent on the data and its encryption. 


session key A key for symmettic-key cryptosystems which is used for the duration of one message or communi- 


cation session 


SET Secure Electronic Transaction. MasterCard and Visa developed (with some help from industry) this standard 


jointly to insure secure electronic transactions. 
shared key The secret key two (or more) users share in a symmetric-key cryptosystem. 


shrinking generator A stream cipher built around the interaction of the outputs of two LFSRs. See also stream 
cipher and linear feedback shift register. 


Skipjack The block cipher contained in the Clipper chip designed by the NSA. 
SMTP Simple Mail Transfer Protocol. 


smart card A card, not much bigger than a credit card, that contains a computer chip and is used to store or 


process information. 


special-purpose factoring algorithm A factoring algorithm which is efficient or effective only for some 


numbers. See also factoring and prime factors. 


standards Conditions and protocols set forth to allow uniformity within communications and virtually all 


computer activity. 
stream cipher A secret-key encryption algorithm that operates on a bit at a time. 


stream cipher based MAC MAC that uses linear feedback shift registers (LFSRs) to reduce the size of the data 


it processes. 
strong prime A prime number with certain properties chosen to defend against specific factoring techniques. 


sub-exponential running time The running time is less than exponential. Polynomial running time algorithms 


are sub-exponential, but not all sub-exponential algorithms are polynomial running time. 
subkey A value generated during the key scheduling of the key used during a round in a block cipher. 


subset sum problem A problem where one is given a set of numbers and needs to find a subset that sums to a 


particular value. 
S/WAN Secure Wide Area Network. 
symmetric cipher An encryption algorithm that uses the same key is used for encryption as decryption. 
symmetric key See secret key. 


synchronous A property of a stream cipher, stating that the keystream is generated independently of the plaintext 


and ciphertext. 


tamper resistant In cryptographic terms, this usually refers to a hardware device that is either impossible or 


extremely difficult to reverse engineer or extract information from. 
TCSEC Trusted Computer System Evaluation Criteria. 


threshold cryptography Splitting a secret (for example a private key) into many pieces such that only certain 


subsets of the n pieces may be combined to form the secret. 


timestamp See digital timestamp 
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tractable A property of a problem, stating that it can be solved in a reasonable amount of time using a reasonable 


amount of space. 


trapdoor one-way function A one-way function that has an easy-to-compute inverse if you know certain secret 


information. This secret information is called the trapdoor. 


traveling salesman problem A hard problem. The problem is: given a set of cities, how does one tour all the 


cities in the minimal amount of distance traveled. 
trustees Á common term for escrow agents. 
Turing machine A theoretical model of a computing device, devised by Alan Turing. 
verification The act of recognizing that a person or entity is who or what it claims to be. 
Vernam cipher See one-time pad. 


weak key A key giving a poor level in security, or causing regularities in encryption which can be used by 


ctyptanalysts to break codes. 
WWW World Wide Web. 


XOR A binary bitwise operator yielding the result one if the two values are different and zero otherwise. XOR is 
an abbreviation for exclusive-OR. 


zero knowledge proofs An interactive proof where the prover proves to the verifier that he or she knows 


certain information without revealing the information. 
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